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uses and a review of most of the existing literature on charts and 
graphs > 35 references listed) are provided* The literature review is 
used to develop the analytic scheme/theory and to justify the methods 
chosen by the research team* A psychological approach is taken 
because of recent advances in cognitive science which allow a 
modeling of visual interpretation as something more than a simple 
recording system* The authors indicated that this is tha first 
attempt at a comprehensive application of applied cognitive science, 
proving the usefulness of the knowledge being built up, supporting 
cognitive science in the sense of building a technology on it, and 
allowing deeper insights into human products and how to make them 
better* One hundred and thirty figures and tables are attached* 
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CHAPTER 1: Introduction 



It is often said that a picture is worth a thousand words* But this is 
not always true; many pictures are not even worth a dozen words* The worst 
offenders may be charts and graphs, pictures that are intended to convey infor- 
mation more effectively than could be done using words and numbers* But as 
anyone who has even glanced through the major national news magazines knows, 
charts and graphs often fall woefully short of this goal* This book is about 
the reasons why charts and graphs are all too often ineffective, uninterpre- 
table or semi-interpretable pastiches at best serving to make a page visually 
interesting* The other side of this coin is, of course, the ways in which 
charts and graphs can be a*ade to be effe^'ive, and much of this book focuses on 
this topic* 

Consider Figures 1*1 and 1*2, which appeared in Fortune magazine and the 
American Scientist , respectively* what is wrong with Figure 1*1? Can you 
understand it clearly? Most people are qaick to notice that the colors used to 
draw the functions are too similar, and most people are confused by the taper- 
ing shape* Now, what about Figure 1.2? This one is so awry that most people 
have difficulty simply discovering what the graph is about* But why? We could 
hazard guesses, but this clearly is not the best way to proceed* What we need 
is a systematic and well-motivated way of diagnosing the problems with specific 
displays* At the end of Chapter 5 we will return to Figures 1*1 and 1*2, and 
see what more can be said about their shortcomings* 



INSERT FIGURES i*1 AND 1.2 HERE 



This book describes the results of an extended research program on how 




charts and graphs convey information* This program has t*jo major foci which f 
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are played upon in the ambiguity of the title* By "understanding charts and 
graphs** we refer not only to the scientist's analysis of charts and graphs, but 
to the process whereby a reader comprehends them* Our first aim is to develop 
a scheme for describing analyzing the informat conveyed in charts or 
graphs* Iftis scheme is designed bo lay bare the particular problems inherent 
in a display, if any* Iftis analytic scheme is focused on the chart or graph as 
an object in its own right, and its properties are described in^terms of how 
the set of lines and marks on the page function as a complex set of symbols 
embodying information about objects or events in the world* Ifte scheme is 
oriented around a set of principles that must be obeyed if a display is to be 
readily interpretable* 

The second focus of the research program has been to develop a psychologi- 
cal theory of the knowledge in the head of the reader and of the mental events 
that occur when he or she attempts to read a graphic display* This theory, in 
part, provides the justification for the way we analyze charts and graphs, in 
that our analysis is supposed to tap the features of charts and graphs that 
make them relatively easy or difficult for a human reader to comprehend* Thus 
there is, in fact, an interplay between the two foci of the program, tho anal- 

ytic scheme and the psychological theory; the second provides the backdrop for 

* 

the first* 

The research program we describe here differs from all other work on 
charts' and graphs in two important respects* First, it is comprehensive* we 1 
consider charts and graphs at multiple levels of description, from lines on a 
page to abstract mathematical symbol structures to concepts in a person's head, 
and we consider charts and graphs intended for a wide range of different uses* 
In addition, in the course of developing our analytic scheme and theory, we 
review most of the existing literature on charts and graphs and how people 



comprehend them* This review is not included solely for purposes of complete- 
ness, however; rather, we use findings in the literature to help us develop 
both the analytic scheme and the theory, partly by using the findings to Justi- 
fy the way we have chosen to procede as opposed to alternatives const . "ed by 
others* Second, our system is firmly rooted in concepts developed in the study 
of perception and cognition* As noted above, even our analytic scheme is psy- 
chologically-oriented, and is intended to reveal the ways in which a given 
chart or graph is difficult for a person to interpret* 
Why a psychological approach? 

The psychology of the graph reader is a running theme throughout this 
book, and our empnasis on it is in fact the raison d'etre of the entire 
research project on which the book is based* To a psychologist, the worth of 
this approach is obvious, and we hope the fruits of our research as presented 
here will lead the nonpsychologists among our readers to a similar conclusion* 
But until now, few have explored the relations between the design of good 
graphs and the psychology cf the people who must read them* A search through 
the psychological literature of the last century turns up only a handful of 
studies on gi'aph reading, and the "how-to" guides for graph designers often do 
not seem to make ev^n the slightest concession to the fact that the intended 
audience for graphs consists of humans rather than robots or Martians* 

The reason for this failure of minds to meet, we feel, is fairly simple* 
In everyday life, it is natural to think of our eyes as simple recording sys- 
tems, registering the world as it is* But even a moment's reflection (not to 
mention a century ?f laboratory research) can show that this analogy can be 
misleading* Consider the following examples* Cereal manufacturers can design 
boxes that look tvvce as big as their competitor's, but do not contain twice as 
many cornflakes* We readily notice a gain or loss of S pounds on a slim per- 
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son, but are oblivious to a weight change on an obese person unless it is many 
times more extreme* Rows of reflectors on a dim highway, or formations of 
geese flying overhead, stand out perceptually as cohesive solid objects; ani- 
mals or airplanes ■?> blotchy camouflage are not seen as obje -it all* 
Naturally, these biases built int*> our eyes and brains will not disappear when 
we look at graphs instea*. of birds, and obviously an effective graph designer 
will do best by being aware of these biases* 

Incidentally, our knowledge of human vision has underscored not only its 
deficiencies in comparison with raechanicaT optical systems but also in many 
cases, its superiority, and in others, sheer differenjes in operation* Check- 
reading machines can record those odd-looking numerals at the bottom of checks, 
but unlike our eyes, these mechanical visual systems cannot make head nor tail 
of the names and dates printed at the top* People don't think shadows are 
parts of objects or ttvt a tree lined up with a person is attached to him, but 
even the cleverest computer vision systems are prone to make such mistakes* 
And computers in general are indifferent as to whether a given set of numbers 
enter their data banks as a pattern of black marks on a page, a pattern of 
beeps over a telephone line, a pattern of holes punched on a card, or a pattern 
of movement of a joystick* But humans seem to prefer their numbers in graphic 
form, even though lists of numbers (or for that matter precise patterns of 
rising and falling tones) can contain identical information* Iftese peculiar 
biases of ours, taken together, can shed light on the structure of our visual 
systems, a structure that makes us unique among optical information processing 
devices* The details of this structure, in turn, determine the ease or diffi- 
culty with which people with various sorts of training will extract various 
sorts of information from various sorts of graphs* 




Applied Cognitive Science 

This book is more than a psychological analysis of charts and graphs or a 
Psychological theory of how they are comprehended/ although it is both of these 
■ angs* our approach is of a very recent vintage/ expanding beyond tradi- 
tional boundaries of the field* In fact/ this book represents the first com- 
prehensive program of its kind in the emerging discipline of "cognitive 
science* 1 ' Cognitive science draws theory/ methodology/ and conceptual tools 
from linguistics/ philosophy/ and computer science/ in addition to psychology* 
We have put to use many ideas and techniques from this broader discipline in 
developing our analytic scheme and processing theory* The core of our analytic 
scheme is drawn frora basic distinctions in linguistics and some ideas developed 
in philosopny/ and the backbone of our theory rests on concepts developed in 
computer science* Further/ in addition to drawing on the psychological litera- 
ture to buttress our empirical claims we rely heavily on methodologies 
developed in linguistics to test specific aspects of our ideas* 

r 

Thus/ this book demonstrates how one can "cash in" on the abstract ideas 
that have been percolating in cognitive science* Demonstrations of the applic- 
ability of a body of knowledge are useful for a number of reasons* First, the 
value of obtaining the abstract knowledge is underlined if it can be put to use 
(especially if the uses are unexpected clearly spinning off of the abstract 
knowledge per se and not a special effort to discover something useful) * 
Nobody questions the value of studying physics/ if only because of the bounti- 
ful harvest of technology from the pure research* Second, the mere fact that a 
technology can be built upon the fruits of pure research is another kind of 
evidence supporting the theories and general approach that guided the research* 
That is/ one metric of evaluation of a theory-is how well that theory not only 
explains old data and predicts new data, but how well it le< ds to the produc 
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tion of usefui phenomena or insights* And, of course/ there is a third reason 
why finding applications of cognitive science in particular is a good idea; it 
promises to give us deeper insights into human products — such as charts and 
graphs—and how to make 1* n better* That is* we hope bo use our the and 
general approach to research to tailor the things we use in everyday life such 
that they are maximally compatible with how we think^and what we are* This 
book is one demonstration of how such an enterprise can proceed* 

BACKGROUND 

Even a casual pe-usal of the literature immediately convinces one that 
there is a real need for research on charts and graphs/ and that there is a 
real need for a systematic approach to the topic* Research on charts and 
graphs is* in a word, scanty* Psychological jibs tracts lists about a dozen 
studies conducted in the last quarter-century/ many published in esoteric 
sources* The available literature falls into three classes; **How to H books 
for graph makers, graphic tools for statisticians, and laboratory research 
which compares graphs to other Media and investigates the comprehension of 
charts in graphs in general* We will consider this last category when relevant 
in the remainder of this book, but let us get a sense of the general run of the 
field by examining tbe other two classes now* 

The largest category of treatments of charts and graphs is clearly the 
*How to" books (e*g*, Brinton, 1 91 9j Carrol, i960; Haskell/ 1920; Luts, 1949/ 
Rogers* 1961)* These books typically divide graphs into different categories 
(e*g*, line graphs/ bar graphs/ pie graphs/ and pictograms), provide pointers 
on how to construct them (based primarily on the author's experience), and 
of f<ij>^ few rules of thumb as to which graphs should be chosen to represent 



which types of information (e.g.* ^trends should be conveyed by line graphs, 
and proportions by pie graphs") * They also offer suggestions on improving the 
clarity of graphs ("e.g., "if overlapping lines on a graph are cluttered togeth- 
er and hard to differentiate, expand the vertical scale, draw the lines i* 
different colors, or place the lines on separate grids")* 

Although "How to" books may serve well as basic primers, their usefulness 
to, the researcher is limited for the following reasons. First, although the 
conventional taxonomies of graphs reduce the variety of graphs to a more man- 
ageable number, they do not specify graphs in terms of the relevant psychologi- 
cal dimensions, a prerequisite to predicting how easily the graphs will be 
understood* Second, the rules of thumb on the visual clarity of a graph and 
its appropriateness for representing a given type of information are not based 
"on empirical studies of graph comprehension* Rather, they are based on the 
intuitions of the author, which may be unrepresentative or contaminated by his 
or her professional prejudices* Furthermore, concensus among many authors over 
a set of rules of thumb may not be an adequate indicator of their soundness* 
These "How to M books follow **ch other's presentations closely/ and they may 
simply be Resenting an arbitrary, institutionalized conventional wisdom* 
Third, the rules of thumb describe comprehensibility in vague, global terms* 
The problem with this is that while readers may report that graphs constructed 
according to these dictates are easy to understand, the information they get 
from the graphs may be distorted in subtle but important ways — such as the 
reader seeing merely an increasing trend when the Graph should specifically be 
depicting an eaqponentially increasing trend* Finally, the rules of thumb do 
not illuminate in any obvious way the cognitive processes involved in graph 
comprehension, whicii must be understood if psychologically-motivated principles 
of graph construction are ever to be developed* 
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Nevertheless, the "How to" books do have some uses* First, they provide 
large and varied samples of graphs giving us the opportunity to test the power 
of any analytic scheme (as will be discussed in detail in Chapter 5)* Seconds 
the rule. thumb may hav heuristic value — if one rule seems particularly 
plausible, it can direct attention to one aspect of a graph and seme operating 
principle of a cognitive component, suggesting an area of potential research* 
Finally, once a solid theory has been developed, a test of its adequacy can be 
made by returning to the rules of thumb, and noting how well the theory can 
explain the effectiveness (or lack thereof) of these rules* 

A second source of insights on the use of graphic techniques comes from 
statisticians (e.g. Barnes, Pearson, and Reiss, 1955; Duntemann, 1967* Mullet, 
1972* Tukey, 1971; Wainer, 1974* also see the "Teacher's Corner" feature of The 
American Statistician )* The statisticians offer ways to graph data that make 
certain properties of the data salient. The psychological hypothesis under- 
lying these graphing tools is that a statistical concept or parameter can be 
most easily grasped if it is displayed as a (preferably unidimensional) visual 
parameter like length or size. In addition, there are sometimes more specific 
hypotheses — for example, consider Tukey's (1971) suggestion that the human 
visual system is better adapted to judging the degree and type of scatter about 
a straight line than about a curve. This notion led Tukey (1971) to suggest 
that when one wishes to depict goodness-of-f i t of an observed to a theoretical 
distribution, one should use a fl hanging histogram" instead of a conventional 
one. In a hanging histogram, one end of each histobar is anchored at the line 
representing the theoretical distribution function and the other end "hangs" 

down toward the abscissa (see Figure 1.3). This allows one to assess all of 

j 

the histobars relative to the same horizontal line — a task thought to be easier 
than gauging the scatter of ttw upper ends of the histobars about the curved 



line representing the theoretical distribution in a conventional histogram. 

INSERT FIGURE 1*3 HERE 

Unfortunately, these suggestions are not much more valuable for the pres- 
ent purposes than are those of the "How to'* ->ks* Their effeo* iveness is 
unknown, they are seldom tested empirically, and in one instance in which a 
suggestion was tested (Tukey's hanging histogram proposal, in fact)* no advan-* 
tage over conventional techniques was found (Wainer, 1974)* In anv case, since 
the techniques are designed for highly specific types of information, their 
relevance to the cognitive processes involved in comprehending graphs is un-\ 
clear* The usefulness of the statistical tools, then, is similar to that of 
the "How to" books* They have a heuristic function, leading one to test their 
predictions (especially their specific predictions about visual classification 
processes), and to search foe explanations for those predictions that are 
confirmed* * 

Thus, we are forced to rely on the empirical studies of graphic comprehen- 
sion, which attempt to collect data supporting' a given claim* Without such 
data we simply have no idea which notions should be taken seriously and which 
merely seemed lite good ideas at the time* But first need a way of making 
sense o£ the data, a way of structuring the issues and investigations that will 
allow us to draw out the practical impact of research findings* Thus, in the 
following chapter we develop a conceptual framework for characterizing charts 
and graphs* This framework is then used in the two following chapters, in 
which we review the empirical findings that bear on each of our operating prin- 
ciples* We will cast a somewhat critical eye on these findings, attempting to 
cull out those which are so methodologically flawed as to be of dubious value* 
In point of fact, most of the studies of charts and graphs in the literature 
are not much more useful than the "How to r books in the statisticians propo- 



sals; they too should best be regarded more as a heuristic source of sugges- 
tions than as the genuine foundation for a body of: research* Many of these 
studies confound perception and memory; all simply tally errors rather than 
scaling perceived values of a graphed variable psychophysical^; and most of 
the- studies perf^ ,aed prior to the 1960's exhibit serious flaws in their design 
(e*g*, failing to counterbalance order of presentation of conditions, using 
only a single set of data to be presented to the subject in each format, not, 
informing the subject of what should be attended to in the graph, and providing 
ambiguous instructions)* However, there are ample findings in the mainstream 
psychological literature that do bear directly on the perception and comprehen- 
sion of charts and graphs^ although they have not previously been regarded in 
this way* We will consider these findings and then implications in conjunction 
with new data we will provide along the way* 
Using this book 

This book can be read in two ways* Each chapter develops some ideas per- 
taining to graph communication in some depth, often reviewing a sizeable body 
of literature* We hope that the reader interested in actually studying charts 
and graphs or in further developing a scheme like ours will find these techni- 
cal details important* For the reader interested in simply obtaining some 
practical guidelines to designing better charts and graphs, it might suffice 
simply to skim the chapters and focus on the conclusions* However, it is our 
hope that graph designers will become increasingly aware of the psychological 
makeup of the audience for their creations, and thus we would encourage graph 
designers to try to absorb the psychological rationales for the guidelines in 
addition to the guidelines themselves* 
Overview of the book 

This book has three distinct sections* The following chapter develops the 
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analytic scheme, an integral part of which are a set of "operating principles" 
which must be adhered to if a chart or graph is to communicate effectively* 
The principles themselves are developed in detail in the second major section 
of the book* The initial principles stem from well-e' x li' facts about the 
human perceptual system such as those mentioned earlier in the introduction, 
and thus there is a substantial body of laboratory research that pertains to 
them* This literature is reviewed and the morals. for the chart and graph maker 
are distilled* In addition, there are principles that do not emerge from th* 
study of basic perceptual processes, but are revealed only when we examine how 
the eye and mind interpret charts oc graphs per se* These principles are de- 
rived in part by using an inductive methodology that has proved highly success- 
ful in the study of human language* We gathered a large representative sample 
of charts and graphs, assessed how easy or difficult each one appeared to us, 
and treated these judgements as empirical data about graph readers (in this 
case, ourselves) in need of explanation* As a first step towards that explana- 
tion, we formulated the smallest set of principles we could find that concisely 
categorised the problems we experienced in interpreting the graphs in our sam^ 
pie* The data we Marshall in support of these principles is akin to those used 
by linguists concerned with developing grammars* Such grammars are developed 
and tested by considering which strings of words form proper sentences and 
which do not, and why* instead of sentences, however, we construct minimal- 
difference pairs of displays, with the difference between them reflecting a 
difference in the operation of a specific principle* if one display is clearly 
inferior to the other, we reason, and this inferiority is localised to that 
aspect affected by the principle in question, then this provides support for 
the psychological validity of the principle* in most cases the inferiority of 
one member 01 a Pair is overwhelmingly obvious, and hence the reader's intu- 
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itiona can be treated as a kind of data in their own right* But we have taken 
the precaution of collecting data from naive subjects to butress our claims 
about the effects of violating our operating principles* In the final chapter 
of this i " fl le section, we summarize the details of a survey of a representa- 
tive sample of charts and graphs from a wide variety of sources* giving thr 
reader some sense of the most common sins committed by graphic artists* We 
also include here the short form of our analytic scheme* which can be used by 
anyone to evaluate a chart or graph* This scheme has been validated and 
assessed for reliability, as described in-this chapter* The first chapter of 
the final section of the book consists of a description of new psychological 
theory of graph comprehension; a theory of what we know when we know how to 
read a graph/ how we use that knowledge when we read a particular graph, and 
how we attain that knowledge to begin with* The theory uses a large body of 
research on perception, cognition, and memory to integrate the conclusions of 
the previous chapters a..d to generate predictions for future research* Next we 
offer a set of gu* Klines — based on both the theory and the analytic scheme — 
for constructing charts or graphs* And in the final chapter we show how the 
present project can be generalized to the design and use of maps* diagrams and 
other sorts of visual displays* 
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CHAPTER 2; CHARACTERIZING EFFECTIVE GRAPHIC COMMUNICATION 



Everyone has had the experience of opening a well-known national news 
magazine and puzzling over a chart or gr ^* , trying to figure out what it is 
about and what it is supposed to be telling :he reader* Often one can point to 
some aspect of the offending bit of graphics and say that those lines are too 
close together or that mislabeled axis is the root of the problem* But oi.ten 
one is not so sure exactly what is wrong and unable to tell the artist how to 
improve his or her work* In this chapter we develop a scheme for describing a 
chart or graph that has led to a systematic way of characterizing what is 
right, and wrong* about any given chart or graph* Because of the way the 
scheme was designed, it should be easily u^ed to describe any unambiguous chart 
or graph in a straightforward way* When it cannot be easily applied, this is 
like a red flag waving, telling us that there is something wrong* We have 
developed — and tested, as will be described later--a set of principles that 
should be adhered to if a chart or graph is to be effective, and usually one of 
these principles (bo be described shortly) has been violated when the scheme 
cannot be used easily* 
Types of Visual Displays 

There are numerous and varied ways in which people illustrate ideas or 
concepts* Cartoons, for example, can illustrate the artist's impressions by 
subtle variations of the thickness of a line (making a politician appear to 
have a heavy, caveman-like brow)* Similarly, m*C* Escher's bizarre visions can 
force the viewer to see things in a new light* But these artistic uses of 
visual media are not the topic of this book* We are concerned with how quanti- 
tative information and relations among qualities are communicated graphically* 
These displays necessarily use* symbols- -marks that are interpreted in accor- 
dance with convention* There are common types of "symbolic 1 ' displays, which 
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differ in terns of what information is communicated and how information is 
communicated * 

Graphs are the most constrained form, with two scales always being 
required and values or sets ot values being associated via a "paire^l with" 
relation that is always symmetrical* 

Chart s arc less constrained because the entities being related are less 
constrained (they can be depictions* names, or numbers) and there is a wide 
variety of possible relations (practically anything)* Nevertheless, charts 
have an internal structure, where entities must be visibly connected to other 
entities by lines that serve as links* These links can be labeled or 
unlabeled, directed or undirected, and need not simply pair entities* 

Maps are unlike charts and graphs in that they are not entirely symbolic; 
a part of a map corresponds nonarbitrarily to a part of a territory that is 
; ctured* The internal relations among parts of a map are determined by the 
internal relations of what is pictured* However, maps usually include a sym- 
bolic component (e*g*, different colors representing different population), and 
labels are paired with locations by superimposing them* 

Diagrams are schematic pictures of objects or entities* These can be 
picturablo objects, such as parts of a machine, or abstract concepts, such as 
forces acting on the parts* A diagram is symbolic in that special symbols 
(e*gw cross-hatching to illustrate curvature) are used; a photograph is not 
symbolic because no "conver^iorial* means of representation are exploited* 
Unlike charts and graph*, the parts of a diagram correspond to parts of some 
actual object or entity; and unlike maps, parts of diagrams do not represent 
locations of a territory* 

Finally, tables are the least constrained and most general of the lot* A 
table can have words, numbers, or pictures* They can be arranged any way the 
de^'^er wants (providing, of course, that the arrangement allows the reader to 
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extract the necessary information — but we *ill get to this shortly)* Instead j 
of numbers representing the population of each state, the illustrator can pre- 
sent a pap-like drawing where the size of the state represents the number* 
Note that appearances notwithstanding, such an illustration is not really a 
maP: it uses the shapes of the st^ as labels; the actual spatial relations 
among the states is irrelevant for the purpose at hand* The states could be 
broken into four main regions , north, south, east and western regions, if it so 
suited the artist — or even listed in a column in alphabetic order* Tables, 
unlike charts and graphs, either have no internal organization or are organized 
globally* A set of balloons whose size corresponds to the amount a politician 
has talked a given day uses size as a numerical value, and the order of the 
balloons is irrelevant* In some cases, however, the relation to row and column 
headings is important* the immediate pairwise relations among entries always is 
irrelevant* 

In this chapter we concentrate on a detailed treatment of graphs, and to 
some extent charts, for a straightforward reason: grap^is are the most general 
form which at the same time very is constrained* That is, there are numerous 
different types of graphs — line, bar, surface, divided bar, pictograph — and yet 
the way they function to communicate information is well-structured* Although 
some display types, such as maps, are more constrained (the shapes must resem- 
ble those of the regions being represented), they are also less varied* We 
hoped thac by understanding charts and graphs we would develop a system rich 
enough to encompass all of the types of displays* In this case we would for 
the most part simply "relax" variour: stictures for making a good chart or graph 
when considering making a good map, diagram, or table* Thus, our approach in 
this book will be first to understand the most structured and demanding cases, 



15 



where graphs are used to communicate detailed information clearly and con- 
cisely* We then will turn to special cases, where only some subset of the 
complete information need be conveyed, and will consider variants on the 
standard graphic formats arv* varieties of other display types* 

This chapter has thret .iajor parts* We begin by outlining the foundations 
of our analytic scheme* Following this, we present the analytic scheme itself/ 
filling in more details about the basic ideas and how they were implemented (in 
particular, we introduce the "operating principles" here) * Finally/ we present 
two examples of how the scheme is actually used to analyze charts and graphs* 

I* THE ANALYTIC SCHEME 
A* TWO FOUNDATIONS 

The analytic scheme has two deep taproots* The first is the literature on 
how humans process visual input, and the second is the so-called theory of 
symbols* 

visual information processing 

A wide range of activities is interposed between that instant when you 
first fixate your gaze upon a visual display and that moment when you success- 
fully extract some given information from it* Hie explosion of interest in 
cognitive psychology in recent years has given us a general framework for 
talking about these activities and has given us a rich body of literature 
concerning their operation* An effective visual display must not require use 
. of mental operations people cannot perform, a nd must be easily dealt with using 
the operations we do not have at our disposal* Thus, it will behoove us to 
consider briefly now (but in more detail shortly) what is known about visual 
information processing, and then to consider how to use this information to 
diagnose bad displays a nd guide in the construction of good ones* 



Insert Figure 2*1 About Here 

Consider Figure 2* 1, which is a very simple schematic of three main types 
of visual processing* Ifte left most box represents "sensory information 
storage"* The information present in an aft '"nage is in this kind of storage. 
It is very brief (for only a few tenths of a * \;ond) and contains virtually 
unlimited information during that time* Ifte middle box is "short-term memory" 
(the word "memory" here is being used as in a computer's memory--a place where 
information is kept), The information stored here is usually accompanied by 
some conscious experience (such as of saying a word to oneself), and can be 
held in short-term memory by rehearsal (rote repetition)* Information only 
stays in short term memory for a few seconds unless actively rehearsed, and 
only a srsali amount of information (about 4 groups of items) can be held in 
this store at the same time* Short term memory is important here because it is 
the locus when conscious re -organisation and re-interpretation takes placer and 
its limitations severely affect what kinds of re-organisation and re-interpre- 
tation can take place (as will be discussed shortly)* Finally, the right-most 
box is "long-term memory"* Iftis memory stores a huge amount of information for 
an indefinite amount of time; your childhood memories/ your telephone number, 
and the name of your favorite book are all stored here, as well as your know- 
ledge of arithmetic and how various types of graphs (e*g*- line vs bar) serve 
to communicate information* 

In Figure 2*1 are schematised a number of properties of our visual infor- 
mation processing systems that affect reading charts and graphs (along with all 
other visual stimuli)* Four of these properties pertain to how information is 
transferred from sensory -in formation storage to short-term memory (and hence 
t into awareness)* First, if tie stimulus is too small or not contrasted enough 
witli a backgrounds you will simply fail to see it* The discriminability limits 
of the system must be respected if any further processing is goinj to happen* 

17 



Second, there are well-known systematic dis tortions in size and other proper- 
ties of objects* For example, if you estimate the relative areas of two 
circles, you are very likely to underestimate the size difference. Hiese dis- 
tortions are reasonably-well understood and can be avoided o*" compensated for 
in a display (as will be discussed in the following chapter! Hiird, some 
aspects of a stimulus are given priority over others; we pay attention first to 
abrupt changes of any sort (e.g., heavier marks, brighter colors). Fourth, 
stimuli are organized into coherent groups and units by the time we become 
aware of them. Much of this organization is "automatic", not under voluntary 
control, and is determined by reasonably-well understood properties of stimuli 
(e.g., proximity of elements) . The grouping imposed by these automatic pro- 
cesses must be respected if a chart or graph is to be seen the way a designer 
intends * 

Gi Tr en that information has been transferred from sensory-information 
storage, the next constraint we misz consider is the capacity limit of short- 
terra memory. If too much information must be held in mind at once, a person 
will be unable to perform a task. Thus, the complexity of a display will be a 
major factor in determining its comprehensibility. Once a display is in short- 
term memory, it is described . Tall bars, for example, are described as 
"large". A picture of a tall tree standing for a bar in a bar graph will be 
described both as a tree and a? large. The description assigned here on the 
basis of the appearance of a display must correspond to one stored in long-term 
memory if it is to be interpreted correctly. And the way a display will be 
interpreted depends on which stored information is most closely associated with 
the description assigned to the display, if a line is described as "steep" it 
vill be taken to represent a "sharp rise" in prices or whatever; if it is des- 
cribed as "shallow" it will be taken to represent a "slow rise" (even if it is 
the same information, jusc graphed on different-shaped axesl). 
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Finally, in long-term memory the major constraint is a person's knowledge * 
If a person does not know the meaning of a word, or of a pattern of lines form- 
ing the framework of display, he or she will have trouble associating the 
description of the display with the correct inte-oretat" In addition to 
general background knowledge, knowledge of the task at hand can have some 
important consequences: if the initial description of the display does not 
correspond to any stored information, knowledge of the task at hand can lead 
one to consciously re-organize the pattern, leading to new description and a 
new'attempt t3>J.nterpret the description against stored information* For 
example, if one sees a Star of David, one will organize that as two overlapping 
triangles* if asked whether there is a hexagon in the pattern, one will have 
to reorganize the pattern before seeing the hexagon in the middle* 

The foregoing activities are relevant whenever one is trying to interpret 
what one sees* The details of these activities have yet to be specified (in 
chapter 6 we present one theory) - but the basic kinds of operations seem clear 
enough* We certainly know enough about each operation and properties of the 
system to apply this knowledge to the designs of visual displays* Hie "Psycho- 
logical Maxim" is straightforward: Do not design a display that overtaxes the 
human infornation processing system* The analytic scheme we have developed is 
in part a systematic way of discovering whether a given display has violated 
this maxim* And if so, our scheme is designed to reveal exactly how a display 
offends our processing abilities and exactly which abilities have been compro- 
mised* 

Symbol systems 

The second foundation of our analytic scheme is the theory of symbols* 
Soms aspects of charts and graphs have nothing to do with the operation of the 
information processing system* They have to do with the very nature of how 
symbols operate* m the ideal case, a chart or graph will be absolutely unam- 



biguous, with its intended interpretation transparent upon the first glance* 
One way to think about this sort of unambiguity is in terms of mappinqs between 
symbols and concepts* If the graphic display is treated as a complex symbol, 
then we war 1 * a unique mapping between it and one's interpretation of it* 
Goodman has characterised systems that have the property of unique 

bidirectional mapping between a symbol and concept as being "notational*" 
These systems, such as musical notation, are much stronger than we need here* 
In them there is not only a single way of interpreting a given symbol, but 
there is only one symbol that can be used .for any given information* Our 
requirement here is less stringent: given a symbol, there would be only one 
way to interpret it* Thus, for present purposes, there are two important uses 
of the basic ideas underlying notational systems* 

First, we are concerned with the external mappings between the marks on a 
page and the interpretation of their meaning* It is important that the lines 
on the page be read as intended and have the intended effect on the reader* 
Second, there are internal mappings, which specify how marks in a chart or 
graph are paired with other marks; this is especially important when a key is 
used, indicating how labels should be paired with different lines* 

In Goodman's scheme, the first distinction of importance for present pur- 
poses is between a "nark" (also called an "inscription"), a "character class," 
and "compliance class*" A mark is a configuration of lines, such as "A"* A 
character class defines which groups of marks will be classed as equivalent, 
such as "A" and "a"* A compliance class is the referent, the semantic inter- 
pretation, of. the character class, such as "first letter of the alphabet*" 

The* distinction between a mark, character class and compliance class is 
useful in allowing us to contrast cases where marks do and do not map into a 
character class* If a physical mark maps directly into the compliance class, 
variations in the marks (such as weight Q f the lines used) are information 



conveying — which aeed aot be true if a mark merely signals a character class* 
The distinctioj^between marks that map into a character class and ones that map 
directly into a compliance class is the distinction between a sign , which is 
arbitrarily related to the thing represented (e.g., H C" could have been used as 
another mark for the character "A"), an<_ > depiction , in which mat j are non- 
arbitrarily related to the represented information* 

Kosslyn (1980) offers a set of formal criteria for distinguishing between 
marks that signify and marks that depict* Briefly, marks that depict have the 
following properties, none of which are necessarily shared by those that signi- 
fy* First, every portion of the mark is a mark of a portion of the referent* 
The symbol "*" depicts a particular snowflake if every arm, e*g*, HM ', corre- 
s ponds to a part of the snowflake itself* Second, the distance between all 
portions of the mark correspond to the distances between the corresponding 

portions of the object itself* Third, the lines used to inscribe a mark are 

* 

not arbitrary* That is, given the foregoing two criteria, as soon as H%n and 
H -" are ussd in inscribing the mark used to represent the size and posi- 

tion of the remaining lines of the mark representing the snowflake are 
determined* in contrast, any configuration of lines can be defined as an 
instance of a character class* 

Goodman offers five distinct formal requirements for a "notational sys- 
tem*" A notational system allows one to represent information precisely and 
unambiguously* English, then, is obviously not a notational system since am- 
biguous words or sentences are possible* Musical notation, however, meets the 
requirements of a formal notational system* Even though notational systems are 
stronger than we need for present purposes, it will behoove us first to consid- 
er Goodman's five requirements for a notational system here; following this, we 
will trim these requirements down to meet our present needs* Two of these 
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requirements are syntactic, concerning only the properties of marks and charac- 
ters, and the other three pertain to the semantic interpretation of the 
symbols * 

Tb' two syntactic properties are simply put* First, one should not be 
able to .nap a given mark into two different character clat Goodman calls 

this property "syntactic disjointness Second, one in principle should be 
able to decide into which character class a given mark falls* Goodman calls 
this property "syntactic finite differentation*" In other words, the first 
requirement states that marks must be unambiguously interpre table in principle, 



and the second Js tates that one should be able to tell one mark from another so 



that one can interpret a given mark* It is important for present purposes to 
note that the second requirement can be easily violated* Consider an example 
where lines of different lengths are used as marks and where any difference in 
length, no matter how tiny , affects the character class into which the mark is 
mapped* Wow, in thi<* case between any two marks an infinite nunber of others 
exist, and so *^o with any two characters* Given that no physical measuring 
instrument is infinitely precise, this kind of situation violates the require- 
ment of "syntactic finite differentiation," since one cannot decide precisely 
which character class a given mark signifies* In this case, the representa- 
tional system would be called" "syntactically dense*" An example of a syhtac- 
'tically differentiated system is a digital clock, w^ere every reading on the 
clock (i*e*, every mark) is distinctly identifiable and maps into one character 
class (and hence, the system is also syntactically disjoint)* An example of a 
syntactically dense system is a dial clock *vith no tick marks* Now every posi- 
tion of a hand is a different mark, which signifies a different— although not 
uniquely dtcidable — character (time)* This system is also syntactically dis- 
joint because no mark maps into "more than one character, although it is impos- 
sible to identify discrete marks* 




f*0 



The three semantic properties of a perfect "notational system" are con- 
cerned with the way in which one interprets the. meaning of marks; in Goodman's 
terms, they are concerned with the way in which characters are mapped into 
compliance classes* The first two properties parallel the syntactic ones dis- 
cussed above* First, two semantic categories (compliance classes) should not 
overlap so that they share members (as often happens in English) * In other 
words, this "semantic disjointness" property proscribes ambiguous marks* Sec- 
ond, jn a notational system one can identify the compliance class into which a 
given mark should be placed* Hiat is, the system has "semantic fi-oLte differ- 
entiation*" If one cannot decide which interpretation a mark should be given, 
the system is "semantically dense*" So, for example, a digital clock is seman- 
tically differentiated- because every reading has an identifiable meaning (and 
is semantically disjointed because each reading has only one interpretation) * 
A tire pressure gauge, in contrast, is semantically dense because every reading 
on the continuous scale has meaning but one cannot assign a precise meaning to 

any given reading (because between every two readings are an infinite number of 

p * 

other ones, precluding precise assessment of an individual reading)* However, 
if a tire pressure gauge is marked off in discrete intervals, and all readings 
within an interval have the same interpretation, now the system is semantically 
differentiated* Finally, the last semantic requirement is that all the marks 
of a given character class should have the same compliance class* Another way 
of putting this is tha** if marks* can be mapped into a character class, the 
semantic interpretation is terms of the character and not the marks 
directly* 

For present purposes, we have found it useful to streamline Goodman's 
scheme considerably* we are interested in identifying cases in which there is 
a failure to have an unambiguous mapping between marks and meanings in a chart 
or graph* in all cases, when such a problem has Ipeen identified it can be 
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ameliorated by changing the marks used in the chart or graph; even when a label 
is ambiguous, a new word or two can be substituted^ Iftus, we are not especial- 
ly interested in pinpointing a lack of: differentiation oc disjointness at the 
level of syntax or semantic. Given that a relevant co* *: and distinction 
exist in the readership po r jlation, we can merely be concerned with being sure 
that the external mappings from mark to meaning are in fact unambiguous by 
ascertaining that th*i marks are differentiated and the interpretation is dis- 
joint* For internal mappings, we will be concerned with part-for-part corres- 
pondences, which again requires dif ferent4tion and disjointness of the relevant 
parts* 



B. THE DESCRIPTIVE PRQCEOURE 

We have two broad classes of factors that must be considered when design- 
ing a chart or graph* A display must not overly tax our information processing 
abilities, and it must not be ambiguous or deficient in necessary information* 
We have designed a system for describing any given display that allows on© to 
diagnose problems — either psychological or formal — with the display* The sys- 
tem can only be applied easily to a perfect display* when there is any problem 
in using the system, this is like an alarm sounding, serving to alert one to a 
problem* The particular problem is revealed by where the system breaks down, 
and the way in which it breaks down* Ifte system has three components, the 
description proper, the diagnostics, and the evaluation* 
1* Generating a Description 

A description of a chart or graph is generated at three levels, and at 
each level the description is in terms of a set of components and relations 
among them, as described below* 

aj^ Syntax, Semantics, and Pragmatics 

We begin by describing charts and graphs with respect to three broad 
classes of properties* Ifte syntactic properties are those of the lines them- 
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selves; here the lin are not interpreted in terms of what they represent but 
are treated as entities in their own right* In this case, configurations of * 
lines are classified as falling into a set of "form classes," and the way these 
configu^, ns are organized together is specified* In our analysis, these 
form classes correspond to the major "basic lev x" constituents of charts and 
graphs, as will be described in the following section* The semantic properties 
are the direct meanings of the configurations of lines, what they depict or 
signify* The semantic analysis is the literal reajing of each of the compo- 
nents of a chart or graph and the literal, meaning that arises from the rela- 
tions among these components* Finally, the pragmatic properties characterize 
the ways in which meaningful symbols convey information above and beyond the 
direct semantic interpretation of the symbols* At the level of pragmatics in 
language, for example, the question "Can you open the door?" is not really 
comprehended as a question; rather, it is a request bo open the door* The 
conveyed meaning in this case is quite different from the literal semantic 
interpretation; pragmatic overtones of "visual displays hinge on the particular 
description assigned to the visual properties of the display (e*g*, "steep" vs* 
"shallow" lines are seen as different, even if the same data is presented) * 
b) "Basic Level" Graphic Constituents 

We distinguish among four "basic level" constituents of a chart or graph* 
Our notion of a "basic level" is directly analogous to how Rosch (1978) con- 
ceives of the notion cf a "basic level" in categorization hierarchies* In 
categorization, the basic level is the one that is as general as possible while 
still having as similar members as possible* For example, "apple," and not 
"fruit" or "Delicious apple", is the basic levol in< the hierarchy of inclusive- 
ness defined by those names, because that category 'captures the most exemplars 
that ^.re still very similar; going down the hierarchy results in fewer 
exemplars in the category, say Delicious apples, whereas going up the 
hierarchy, to fruit, results in the exemplars not being very similar to 



each other* Similarly, our basic level graphic constituents seem to be the 
most general way of classifying the components of a chart or graph that still 
have a high decree of similarity among the different instances of the class* 
However, in our case the similarity is not in appearance, but in function, \n 
the role a constituent plays in how information is represented . ■* a displa^ , 
The four constituents we use are called the framework , the background s the 
specifier , and the labels * These constituents are defined at the level of 
semantics, In terms of the information directly conveyed* Figure 2*2 serves to 
illustrate these basic level constituents for a typical chare and graph* 

Insert Figure 2*2 Here 
The framework * The framework "sets the stage" whereby the specifier 
material can specify the particular information being conveyed* The framework 
represents the kinds of entities being related (e*g*, year and oil production), 
but does not specify the particular information about them conveyed by the 
display (e*g*, the amount of oil per year)* The framework often has two parts, 
defined partly at the level of syntax: The outer framework extends to the 
edges of the display and serves the role ju^t described; the inner framework is 
nested within the outer one an/i often intersects elements of the specifier* 
The inner framework (often a grid or regular pattern of lines) usually func- 
tions simply to map points on the outer framework to points on the specifier* 
In most cases, the framework serves to organize the display into a meaningful 
whole at the level of syntax* In some charts, however, this is riot true (e*g*, 
see Figure 2*2), although the framework still functions semantically as 
described above. 

The background * It is important to distinguish the framework from the 
background of a chart or graph* The background serves.no essential role in 
communicating the particular information conveyed by a chart or graph* If any 
given background were removed, the chart or graph would still convey the same 
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information at the level of semantics* Although any given background is not a 



necessary part of a chart or graph (often the background is blank), occasion- 
ally a patterned background, such as a photograph, can serve to reinforce the 
information in a chart or graph at the level of **agmatics (e*g*, dead soldiers 
in a graph ^ijtrtit the horrors of war); a pattern*^ background can also interfere 
with one's ability to read a display, as will be discussed in detail shortly* 

The specifier * The specifier conveys the particular information about the 
entities represented by the framework* The specifier usually serves to map 
elements of a framework (actually present or inferred by the reader) to other 
elements of the fratrework* in graphs, the specified is often a line (serving 
to represent a function) or bars which pair values on the j>c and £ axes speci- 
fied by the framework* in charts, the specifier material is often directed 
arrows connecting two boxes or nodes* 

The labels * The labels are alpha, numeric or depictive (i*e*, pictures) 
and provide an interpretation for another line or part thereof (which is a 
component of either the framework or the specifier) * 

In addition to describing these constituents in terms of their syntactic/ 
semantic, and pragmatic characteristics, we also describe the interrelations 
among the constituents* Much semantic information, for example, arises from 
the ways in which the components are physically juxtaposed* In addition to 
simply assigning lin£s to one of the three basic level graphic constituent 
classes, we also describe the constituents in terms of their subcomponents (for 
example, the framework of the graph illustrated in Figure 2*2 is composed of 
two lines that are organized such that one i s horizontal and one is vertical 
and they meet at the lower left side oi the horizontal line) * The subcompo- 
nents are described in terms of simple "Gestalt wholes" (such as line segments) 
and thr relations among them* 'At one time, we considered introducing a set of 



primitive elements" and relations which would provide a fixed "alphabet of 





shapes 11 to be used in all our analyses* This proved very difficult to do, 
however, and proved to be totally unnecessary for our purposes* The wide vari- 

^ ety of charts and graphs seems to preclude specification of a reasonably small 

set of discrete elements from whi^h all charts and graphs can be constructed, 
but even if this were possible, ti ^ important variations seem to occur at what 

0 we have dubbed the "basic level" of organization into the graphic constituents 

noted above* 
2* The Diagnostics 

^ If a chart or graph is unambiguous and easily read, one should be able to 

assign an unambiguous description to it* Whenever one has difficulty in des- 
cribing it, however, this is an indication that the display is flawed* At this 

^ point one tries to categorize the flaw using two classes of diagnostics* 

a) Operating Principles 

Many of the problems with a display can be linked to violations of princi- 
^ ■ es that describe the operation of the human visual system* These violations 

can occur at each of three levels of description* 

Syntactic Principles * These principles describe constraints on how lines 
^ may be interpreted and organized* A syntactic problem is not tied to the lines 

having a specific meaning, but hinges on problems with extracting any meaning 
from lines* If these principles are violated, one either cannot read a chart 
^ or graph (without, perhaps, the aid ot a magnifying glass and ruler), will 

systematically distort information when\ reading it, will tend to have diffi- 
culty organizing it correctly, or will find it difficult to hold the number of 
relevant lines in mind at once* These principles summarize what we know about 
how information is transfered into, and retained in, short-term memory* 

Semantic Principles * These principles describe constraints on the w**ys 
patterns of marks are interpreted* A semantic principle is tied to how a 
specific meaning can be extracted from a configuration of marks* These princi- 
ples were derived primarily through a review of the literature on how people 
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will spontaneously describe a visual display, and on the kinds of concepts 
people must 1 have to understand charts and graphs* If these principles are 
violated, people may become confused in interpreting the meaning of a chart or 
graph* 

Pragmatic Principles * 1fte pragmatic prii. L^les describe t ways people 
in our culture customarily import more meaning than is actually conveyed on the 
page or the ways in which context interacts with display comprehension* These 
principles were derived through an analysis of a set of charts and graphs/ as 
will be described later in the bo6k* 

b) Formal Principles * 

Hiese principles are special-purpose formulations of those underlying 
Goodman's concept of a notational system* They describe aspects of charts and 
graphs that must be respected if the chart or graph is to be unambiguous* 
3* The Evaluation 

Ifte final basic idea of our analytic scheme is that charts and graphs are 
created With a specific purpose in mind; they are intended to all ow a reader 
to answer certain questions and not others* Thus, although an operating prin- 
ciple may be violated the chart or graph may not be impaired — it may still be 
able to serve its purpose adequately* For example, the graph illustrated in 
Figure 2*2 violated what we Will call the Principle of External Mapping (a 
formal principle) because the points on the function do not correspond 
unambiguously to points on the axes* But this is not an impairment in the 
graph/ given its purpose* In fact, wh'en graphs ar*a used as idealizations to 
present a general principle, the additional information necessary to totally 
disambiguate the display may distract from the purpose (see our principles of 
processing priorities and limitations, to be discussed shortly)* Thus, 
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although our scheme faithfully exposes every little detail that violates an 
operating principle, not all of these violations may be important* Whether a 
violation of a principle renders a display ineffective depends on the purpose 
to which the chart or graph will be put* The scheme errs on the <*ide of being 
too conservative, leaving it up to the human user to discount pa' ocular viola- 
tions as he or she sees fit* This was the only real option, given that all 
other alternatives run the risk of not exposing potential problems with the 
chart or graph* Later in the book we will present a detailed theory of how 
people actually comprehend visual displays which will then guide us in applying 
the principles themselves* Before developing and using such a theory of. 
information-processing, however, it will behoove us to explore the usefulness 
of the general approach being taken here* 



II* USING THE ANALYTIC SCHEME 
Our scheme produces a description of any given chart or graph at three 
distinct levels of analysis, the syntactic, semantic, and the pragmatic* The 
description revolves around characterizing the basic level constituents noted 
above, namely the framework, background (if present), specifier and labels, as 
well as the relations among them* In the course of discussing how the scheme 
assigns descriptions to charts and graphs, we will introduce the operating 
principles* The?e principles will be described only briefly here; in later 
chapters we will flesh out the details of each principle* This exercise will 
provide detailed guidelines for evaluating displays and also will provide 
requirements on a theory of how people process visual displays, which will be 
presented subsequently* Following this, the theory will then be used in con- 
junction with the descriptive scheme to provide guidelines for a design of a 
chart or graph intended to make a particular point* 



30 



The following is a description of how the scheme is applied to a single 
chart or graph* At the outset, however, we ask whether the chart or graph is 
in fact composed of a number of subcharts and graphs* That is, we ask whether 
there is more than one chart or graph present and whether there are systematic 
relations among the information in each* If so, the scheme is ied to each 
one separately and then to the set of charts and graphs together* An example 
of an analysis of a complex multipanneled graphic display will be presented in 
the final section of this chapter* 

In each of the levels of analysis, we ask a number of questions that 
should be easily answered if the graph is "unambiguous * If we have trouble 
arriving at a straightforward answer to any of these questions, this alerts us 
that one or more of our operating principles has been violated* We then simply 
consider each principle relevant to that level of analysis, checking to see if 
it has been violate*?* Thus, because the system is set up to reveal violations 
of these principles, it will behoove us to begin each section with a brief 
overview of the relevant principles themselves* Following this, we will con- 
sider the actual mechanics of generating a description of a chart or graph* 

THE SYNTACTIC ANALYSIS 

Operating Principles 

We posit three broad classes of operating principles at the syntactic 
level that cannot be violated if a chart or graph is *to be effective* Each of 
these classes contains a number of specific principles which themselves have 
specific aspects, as will be developed in detail in the following two chapters* 
A* Principles pertinent to seeing the lines 

The visual system imposes numerous constraints on how marks can be used to 
convey information in charts and graphs* The first set of principles bear on 
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how lines, colors, and regions are accurately discriminated--which is a neces- 
sary prerequisite for further processing* We posit two principles that bear on 
the process of discriminating marks: 

1 * The prxnciples of adequate d iscriminability 

Variations in marks must be 9, .at enough to be easily noticed* These 
principles have two aspects^ 

a) Relative discriminability: Two or more marks must differ by a minimal 
proportion to be discriminated* Hie laws governing the size of this difference 
have been worked out for many types of marjcs and these laws comprise this prin- 
ciple, as is described in the following chapter* 

b) Absolute discriminability: A minimal magnitude of a mark is necessary 

for it to be detected* This '•absolute threshold" has been computed for many 

types of marks, as is described in the following chapter* 

2. The principle of perceptual distor tion 
\ 

The visual system often systematically distorts the magnitude of marks 
along various dimensions (such as area and intensity)* This distortion is 
described by the value of an exponent in*a formula developed by S*S* Stevens 
and his co-workers, as is discussed in the following chapter* Marks can be 
intentionally- altered to compensate for the distorting properties of the visual 
system (which, for example, make increases in area seem smaller than they are)* 
B* Principles pertinent to organizing marks into units 

Harks are rarely seen as isolated dots on a page* Rather, individual 
marks usually are organized into perceptual units, such as occurs when a series 

of marks like rt — — — " are seen as forming a single line, not as a series 

of isolated dashes* A set of principles describes the main factors that deter- 
mine which marks will be grouped together into a single perceptual unit* If 
these principles operate to group together elements of a display inappropriate 
ly, the display must be changed* 



t. The GestaXt principles of organization 

The Gestalt'psychologists, who had their heyday during the 1930's, discov- 
ered almost 120 distinct laws that dictated how forms were organized* The more 
important laws (for present purposes) can be summarized by four r ^eral princi- 
ples: 

a) Good continuity: Marks thau suggest a continuous line will tend to be 

grouped together. So, " " is seen as comprising a single unit, not 10 

separate ones. 

b) Proxinity: Marks near each other will tend to be grouped together. 
So, H xxx xxx" Is seen as two units whereas "xx xx xx" is seen as three. 

c) Similarity: Similar marks will tend to be grouped together. So, 
"XXX@@@" is seen as two units. 

d) Good form: Regular enclosed shapes will be seen as single units. So, 
"U H is seen as a unit whereas tt i- n is not. 

2* Principles of dimensional structure 

Marks vary along a number of dimensions, such as h£e, size, height, and so 
on. Some of these dimensions cannot be processed independently of others. For 
example, it is impossible to see the hue of a mark (i.e., its shade of color, 
roughly) without seeing its saturation (i*e., the richness of the color, rough- 
ly). Thus, some dimensions are organized into single units whereas others 
(such as hue and height) are not. Hie dimensions that are "stuck together 1 ' in 
processing are called i ntegral dimensions and the ones that are processed inde-* 
pendently are called separable dimensions. 
C. Principles of processing priorities and limitations 

The visual processing system has quantitative and qualitative limitations. 

> 

Partly because only a limited amount of information can be held in mind at 
once, som*> marks will be given priority over others. Hie information conveyed 
by these marks should be central to the display's message* Further, some kinds 
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of comparisons are difficult for the visual system to perform, and hence a 
display should not require use of then* Ihese facts are the basis for two 
kinds of principles: 

1 * Principles of processing priorities 

Some colors, weights of line, and sizes are noticed before <_ >iers» For 
the most part we do not have formal rules for determining which these are, but 
instead rely primarily on a general principle; the visual system is "a differ- 
ence detector"* Any sharp contrast will draw attention* In addition, some 
stimulus properties have been determined empiriclly to be "salient" (e*g*, all 
other things being equal, a yellowish-orange is noticed before a deep blue). 
Physical dimensions of mar)^ should be used to emphasize the message, not to 
distract from it (e*g*, by making the background too prominent)* 

2* Principles of processing limitations 

These principles fall into two categories: 

aj Finite capacity: Only about 7 units can be seen at a single glance, 
and only about 4 can be held in mind a-t once* Graphic displays should not 
contain any unit (e*g*, group of lines) which itself contains more than 4-7 
subunits le *g» , lines) * 

b) Unit binding: It is more difficult to see and compare parts of percep- 
tual units than it is to see and compare entire units. For example, w - w is 
more difficult to compare to the lower left leg of "x" (not a natural unit) 
than to V" natural unit)* Graphic displays should not require readers to 
decompose natural units in order to extract specific information, as occurs if 
single points along a time line must be interpreted* 
Applying the Analytic scheme 

The main point of describing a chart or graph using our scheme is to re- 
veal violations of the operating principles that impair the effectiveness of 
the chart or graph* i n order to do this, however, one must generate a descrip- 
tion of exactly what is out there, exactly how a chart or graph is composed* 
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Thus, our scheme requires one to engage in two distinct activities* First/ one 
actually describes the chart or graph* This is especially the case at the 
syntactic level* Second, one asks questions about the description, checking to 
ensure that the description is unambiguous and transparent* If not, one <* - 
more of the principles has been violated* The level qf detail of the descrip- 
tion proper is motivated by the kind of information one will need later on to 
assign a semantic interpretation, and then the pragmatics, of the chart or 
graph — again with an eye toward discovering violations of the respective types 
of operating principles* 

We begin by isolating the four basic-level constituents and then asking 
the following questions about them; 

The Background 

We first ask whether there is a background and, if so, we describe it* A 
background extends beyond the framework and does not actually help to cdnvey 
the information in the display; removing the background would not impair how 
the chart or graph functions to represent information* Some backgrounds, how- 
ever, can consist of patterns that make it difficult to detect the pictoral 
material or other lines (and hence, violate the principle of adequate discrim- 
inability) , Other potential problems with background information will be dis- 
cussed later* 

The Framework 

Next we examine the outer framework* We define the outer framework as the 
set of lines that serve to define the general entities that are addressed in 
the display* 

What are the elements? Are they lines? If so, of what shape, 

weight, and color? Are they clearly disce^nable? 

If lines function as a^es, are they dense or differentiated? 
We note whether the framework and its individual component parts are 
easily identified* 
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How are the elements organized? Are the relations among the 
different parts clear? Does the organization violate any of the 
natural organisational principles? How many elements must be held in 
mind at onc<* in order to organize them into the entire framework? 
Occasional. 1 , a chart or graph will also include an inner framework/ such 
as lines that cross-hatch the interior of a chart* If there is an inner frame- 
wcyrk, the same questions noted above are asked of it* 

Next, we consider the organization of the two frameworks/ if both types 
are present* In particular, we ask how the similarity, proximity/ and continu- 
ity of framework elements imply organization* Following this/ we ask a number 
of general questions about the entire framework: 

Does the framework represent 2D or 3D space? Are quantities 
distorted because of an ambiguity here? Is color employed in the 
framework; if so/ what is emphasized? If line weights are varied/ 
what is emphasized? (This will be important later in our pragmatic 
analysis)* what is the aspect of-the axis? (That is, which axis is 
longer; this also will be important in the pragmatic analysis)* 
The specifier 

We begin by isolating the class of visual continua used to represent in- 
fo -motion* then describe how shape, size variations, color and texture are 
used* In a typical chart or graph/ such as that illustrated in Figure 2*2/ the 
line serving as a function cannot properly be described as being syntactically 
or semantically differentiated* Thus, this would seem to preclude the graph 
being unambiguous; recall that one of the properties of notation systems is 
differentiation, ensuring unique mapping from mark to compliance class* How- 
ever, one must take two factors into account here:£ First, what is the intended 
use of the chart or graph? Tor many purposes only a rough approximation is " 
desired/ especially when graphs are idealizations (such as Figure 2*2) t 



intended to illustrate some general point* Second, even when precise informa- 
tion is being conveyed, one is in fact workiru, with "psychological units" of 1 
limited precision; our perceptual apparati simply cannot make discriminations 
beyond a certain limit* Thus, if the smallest discriminable segments o* a line 
used as a function map unambiguously onto b j tallest discriminable ^ ^ments 
of the axes, the chart or graph can function notationally* Thus, we go on to 
ask; 

What are the elements used to compose the specifier? Is it 
clear whether parts are overlapping or contiguous? Are there too 
many elements to keep in rind at once? Are variations used to convey 
information clearly distinguishable? 

How are the elements organized? Is the organization clear? If 
the specifier does not clearly imply a 2D shape, does an ambiguity in 
the dimensionality preclude easy reading of the information? - 
Labels 

We first consider three kinds of labels independently, and then turn to an 

* 

analysis of the relations among the labels* We pay special attention to the 
title, asking first if there is one* IZ sot 

Is the title clearly discriminable as a title? 
What is the* relation of the title to other elements of the chart 
or graph? Does it naturally tend to be organized such that it incor- 
rectly appears to label only a local part of the chart or graph? 
Next, we consider whether there is a remote legenfi or key* If so, we aski 
Is the information clearly readable? 

Does the legend clearly separate *itself from other elements of 
the chart or graph? 

Is there too much material to be easily helc^in memory? 

/ 



37 

4u 



Depending on the type of labels used in the title and legend or key, 
the following questions are then asked about them (as well as about all 
other labels of each type) * 

Alphabetic labels : Are alphabetic labels present? if <*o; 
Are they clearly readable? 
How many are present? 

What size of typeface is used for each of the labels? Note if 
different si^es are used for different labels (this may be important 
at the pragmatic level) * 

How do labels group together? Is the natural grouping congruent 
with the intended interpretation? 

Numeric labels : Are digits used as labels? If so, ask of them the 
same questipn^ asked of the alpha labels* 

Depictive labels : Are pictures used as labels? If so: 
Are they clearly identifiable? 
How many are present? 

Are they all the same size? (note differences) 

How do these labels group together? is the natural grouping 
congruent with the intended interpretation? 

If color variation is an important component in the labels, are 
variations clearly discriminable? 
Organization among the different types of labels 

How are the labels organized? Do any natural organization prin- 
ciples result in an incorrect organization of the labels? (for exam- 
ple, does dissimilar typeface cause one to separate labels that 
should be grouped together? Does proximity of labels cause one to 
group them improper ly? Are labels ordered in such a way that yoi" 
group them improperly?) 
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Organization among framework and specifier 

What is the relationship between the framework and specifier? 
Is the specifier completely contained within the framework? 

Are lines of the inner framework confusible with the specif* r? 
Do natural organizational principles cause one to group the 
framework and specifier incorrectly? 

If the dimensionality of the space is not 2D, is it consistent 
between the framework and specifier? 
. Organization among framework and labels 
The organization between the framework and each type of label is 
considered separately* with the following information being provided (as 
appropriate) : 

How are the labels associated with the framework and parts 
thereof? Are value markings indicated along the framework? If so, 
do the labels clearly indicate the correct values corresponding to 
the associated portion of the framework? 

Do any natural organization principles result in an incorrect 
organization of the framework and labels? 
Organization among labels and specifier 

How are the labels "and specifier associated? Is all specifier 
labeled? If the label is remote* in a key, is the mapping from ele- 
ments in the key to the specifier clear? 

Do any natural organization principles result in an incorrect 
organization of the labels and specifier? 
Organization agong labels, framework, and specifier 

Is too much material present to apprehend all at once? 

Is too much material in too small an area? 

Do natural organizational principles impair discerning the 
incorrect relations among the constituents? 
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THE SEMANTIC ANALYSIS 



In considering the semantic content of a chart or graph, let us begin by 
briefly outlining the four operating principles we have posited, and then turn 
to our scheme for describii the semantic information in charts and graphs* It 
is at this level that the differences between some classes of charts and graphs 
as such become important, requiring us to develop two different sorts of seman- 
tic interpretations, one based on qualitative relations and the other base^ oh 
quantitative relations* 

Operating Principles ^ 

We have posited two classes of semantic principles, both of which are 
supported not only by ample findings in the psychological literature, but by 
new data we have collected (examples of the problems that arise when the prin- 
ciples are violated will be illustrated in chapter 4)* These principles are 
concerned with the kind of description that will be assigned to a display' and 
how it will be interpreted* 

A* Principles cf surface compatibility 

The mark used to symbolize or depict an object or class must be appropri- 
ate for that role. Some marks inherently look like something other than what 
they are intended to represent, which impairs correct interpretation of them* 
This principle has three aspects; 

1* Principle of representativeness 

All marks have a preferred interpretation* Tfre intended meaning of a mark 
should not conflict with the spontaneous interpretation cf it* Thus, labels 
should name words that are indicative of the class (including the correct con- 
notations) and pictures should depict appropriate objects (a picture of a 
penguin-like bird should not be used to. label birds in general)* In short, 
a label or picture should be of a representative or typical example of a class 
or of the class directly* 
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This principle has four aspects; 

a) Description conflict: The description of the lines themselves should 
be compatible with their meanings* For example/ for words printed in different 
colored inks, people ave trouble reporting the color of the ink if the words 
themselves name different colors (e.g*, the word "red" is printed in blue ink; 
this is known as the "Stroop effect")* Thus,' larger symbols (described as 
larger) should represent larger qunatities, faster rising lines should repre- 
sent sharper increases/ larger typeface should correspond to larger objects/ 
and so on* 

b) Aligning Dimensions; The "more" and "less" poles of a dimension used 
in a graph should correspond to the "more" and "less" poles of the variable it 
represents, respectively* Thus light patches should represent smaller quanti- 
ties, and dark patches greater quantities, rather than vice versa; similarly/ 
marks that are high, tall, wide* long/ saturated, filled, dense/ or sharp 
should represent larger rather than smaller quantities* If in doubt/ say the 
words for the two poles in each order; the pole that is first in the better 
sounding order is the "tnoxe" pole (e*g*/ long and short sounds better than 
short and long / thus long is the "mora" pole* 

c) H^rkedness: Some words name not only a pole of a dimension but the 
dimension itself* We say "how high is that?" without implying necessarily that 
it is high/ but if we say "how low is that?" we imply it is low* The term that 
implies a specific value is called the marked term, and should not be used to 
label the dimension itself — if it is, it will mislead the reader* Similarly, 
one should use the unmarked member of a pair of comparative terms; "larger" is 
better than ''smaller", and so on* 

d) Prj^iples of cultu al conventio n: Ifte conventions of a reader's cul- 
ture should bo obeyed when drawing an effective graphic display* So/ for exam- 
ple, the color red should not be used to represent "safe" areas, and green 
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should not be uced to signify "danger*" Similarly/ time should increase going 
left to right or bottom to top* 

B* principles of schema availability 

In order for a chart or graph to comprehensibl' - -~» reader must have the 
requisite concepts* That is, a "complin *ce class" is in fact something in a 
reader's head* The reader must know both the individual concepts and the 
general idea of how a particular graphic design conveys information* 

7* principle of concept availability 

A chart or graph should not make use of concepts that are not likely to be 
possessed by the intended readership* 

2* Principle of graph schema availability 

Information should not be presented in a graph type that is unfamiliar to 
a given readership or that taxes the information-processing abilities of the 
readership population* 
The Formal Principles 

In the course of describing the semantic interpretation of the syntax of a 
graphic display we are faced with describing how the marks map into semantic 
classes* Hius, it is at the point of formulating the semantic description that 
it is most convenient to begin to consider our two general mapping principles/ 
derived from the requirements of notational systems (streamlined for present 
purposes) * These principles deal with external/ "vertical 1 * mappings between 
levels/ and internal, "horizontal" mappings between elements at the same level 
of description, and thus will sometimes be involved in the syntactic analysis 
per se* 

The vertical mapping principle* Every meaningful 1 difference in the value 
of a variable should be represented by detectable differences in marks/ and 
every mark should have one and only one meaning* Ambiguous or missing narks 
violate this principle and require an alteration at the level of syntax* 



42 



ERIC 



The horizontal mapping principle * Portions of the chart or graph that are 
meant to correspond to other portions of the chart or graph should do so in an 
unambiguous way* The key, for example, should clearly indicate how labels are 
paired with different components of* the specifier* This is true both at the 
level of the ma ks and at the level of the meaning, of the marks (most notably 
labels)* This principle-W^distinguished from the natural organizational prin- 
ciple in the following way; when a natural organizational principle has been 
violated, the violation can be corrected by rearranging marks already in the 
display (by repositioning lines and the like) * When the horizontal mapping 
•principle has been violated, new marks must be added (e*g*, lines or arrows 
connecting parts) * A necessary ingredient is missing when the mapping prin- 
ciple is violated* 
Applying the Analytic Scheme 

As in our treatment of the syntax of charts and graphs, we decompose the 
problem of describing the semantic content (the literal meaning) of a graphic 
display into four parts; characterizing the background, the framework, the 
specifier ^nd the labels* As before, when describing the chart or graph, we 
are looking for violations of the operating principles that come to light when 
the display is being analyzed* 

Background 

If the background is patterned, the meaning of the pattern should be con- 
sistent with the information presented in the chart or graph* If background 
figures are present, do they distract from the meaning of the chart or graph? 
Are the elements of the background ambiguous? Is it clear whether elements are 
contiguous or overlapping? Do parts of the background occlude parts of the 
framework such that information is lost? 

Frarowork 

The most important feature of the framework is that it serves to allow the 
reador to extract the meaning of the marks and their organization* The ele- 
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ments of the framework should serve these ends* we begin by asking whether 
meanings of the elements are unambiguous* we note whether any part is not 
present or not implied* Next* we consider whether the syntactic properties of 
the elements engender correct mapping into a compliance class* ^us* we assess 
the scale type used in a graph and note whether the semantic sea ^ clearly 
indicted syntactically* For instance, if the scale useu on the axes of a graph 
is syntactically dense, the semantics — the actual scale being represented" 
should also be semantically dense (e*g*, a ratio scale should not be used in 
making the axes to represent an ordinal scale) * In the same vein, the labels 
along the axes should be compatible with the actual scale being used and with 
the markings along the axes; the numbers spaced along the axis should suggest 
the correct scale type* Many of the problems with frameworks, as the reader 
probably inferred from the foregoing concerns, are violations of the formal 
mapping principles* The principles of surface compatibility also are sometimes 
violated here* Thus, we also ask whether variations in size, color, and the 
like are compatible with what is being represented (color changes from red to 
blue should not indicate rising temperature, for example)* 

We next note the extent of the scale, attending to not only its range, bu*, 
the baseline* This may prove important in the subsequent analysis of the prag- 
matics of the chart or graph* 

In addition to the foregoing questions, we check whether the lines that 
compose the framework depict some object* (This is quite common in many popu- 
lar magazines)* Thus, we &sk; 

If the framework is serving to depict some object or scene, what 
is the meaning? Is the meaning clearly evident, and is the depicted 
object clearly representative of the class of objects being depic- 
ted? 



44 



The specifier 

The meaning of the specifier is derived from how it relates parts of the 
framework together* Thus, in large part we will defer discussing the meaning 
of the specifier until considering the relationships among the different con- 
stituents* However, we c \ ask two things about the meaning of the specifier 
marks jper se* First, they should be concise, no more or less being present 
than is needed to convey the information* If too little is present, the verti- 
cal mapping principle will be violated; if too much, it may be unclear what is 
being conveyed* (Note: if one wants a decorative piece of art accompanying an 
essay, however, this will be a violation only if the illustrations and fancy 
extraneous interfere with comprehension of the actual content*) Second, Speci- 
fiers often are depictions (e*g*, a graph of rising prices could have a jet 
plane taking off, with its exhaust being the function)* if so, we ask; 

Are the depictions clearly representative of the compliance 
class in question? One would not want a picture of a potato to 
stand for "plant life," for example (since potatoes are hardly typical 
--in Rosch's (1978) sense — plants j* 

In addition, one wants to ensure that marks used to represent 
different things look more different than marks used to represent the 
same thing* Further, one whould check -that the literal' interpreta- 
tion of the marks is compatible with the role they play, as noted in 
our principles of surface compatibility* 
Labe Is 

For each type of label, we begin by considering whether the marks used as 
labels are compatible with the represented concept and whether the meaning of 
each label is accessible to the intended reader* Following thi^, more particu- 
lar questions are asked of each of the three types of labels; 

Alphabetic labels t Are the words ambiguous? Are the meanings of all 

the wocrf*; representative of the class being indicated? 
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Numeric labels : Are the units clear? Are the units familiar? - 
Depictive labels : Are pictures used as labels easily identified; are 
they familiar to the intended readers? Are the (narks used to depict 
clearly representative of the incept that they stand for? 
Pair-wise combinations of labels Cases where labels are serving to iden- 
tify other labels (e.g., naming a picture) are also considered vis-a-vis 
our principles* 

Organization of basic level constituents 

Following analysis of each of the individual constituents, we again turn 
to an analysis of the organization of the constitueats of the chart or graph* 
The way charts and graphs are organised is considerably ©ore complex at the 
level of semantics than at the level of syntax, which also seems to be the case 
in language* We have devised two general kinds of rules of combination that 
are critical for deriving all of the information represented by marks in a 
graphic display* one kind of rule is appropriate for graphs, where a quantita- 
tive relationship between two or more values on two or more scales is represen- 
ted; with two scales there are 10 possible combinations among the four scale 
types (nominal, ordinal, interval and ratio) that are commonly used* The other 
kind of semantic rules of combination is appropriate for charts, where a quali- 
tative structure or organization of entities is represented* Let us consider 
each kind of combinatorial scheme \n turn* 

Quantitative Relational Information * Perhaps the best way to present the 
formal properties of this aspect of graphic semantics is in tabular form* 
Thus, the following table relates values on tw scales to each other* We will 
consider all possible, combinations of nominal, ordinal, interval and ratio 
scales except the nominal-nominal relations (which fall in the second class of 
rules)* Recall that nominal scales are not ordered, with numbers being U3ed as 
names (as on football players 1 sweaters); ordinal scales are rank ordered 
according to quantity, but the actual maagitudes of differences irrelevant 
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(as in the first, second and third place winners of a race); interval scales 
are ordered so that the magnitudes of differences mean something/ but ratios of 
numbers do not (as in the Farenheit scale/ in which the point labelled "zero" 
is completely arbitrary); finally/ ratio scales have numbers that are ordered 
so that the magnitudes of diff, ^ujes are meaningful and r* can be computed 
(as in Kelvin degrees, where 10° is twice as hot as 5° — which is not true 
With Farenheit degrees)* in addition to providing an example for each in the 
table, we list examples of the kinds of information available in each case* 
Extensions to n-dimensional cases follow in a straightforward manner from the 
simple two dimensional cases considered here* 

INSERT TABLE 2*1 HERE 

The information content of a graph can then be assessed by interpreting 
the individual axes, noting how points are paired by the specifier(s) , and then 
and using the taxonomy in the table to derive the relationships between the 
values* If the relationship is not clear/ there is a failure of internal map- 
ping (the specifier is not clearly serving to pair points on the framework) or 
a failure of external mapping (part may be missing) * (Violations of many other 
principles can also distort the relationship/ depending on problems in seeing 
the specifier or organizing parts of it correctly*) 

Structural/organization 'information : A computer flowchart/ an organiza- 
tional chart for a government agency/ a nd a family tree do not relate values on 
dimensions* Rather/ they specify the relationships among discrete members of 
some set* This sort of information can be described using the following three 
general criteria* These criteria are independent of one another* . 

Hie first criterion is whether the links between entities are directed or 
nondirectad * Elements of the framework (i*e*/ marks indicating an individual 
member of the set) can be related together either by symmetrical or by asymmet- 
rical relations* For example/ in a kinship diagram/ the vertical links of the 
tree are directed/ indicating who is the parent of whom (an asymmetrical rela- 
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tion) * The horizontal links/ such as "sibling of H (a symmetrical relation), 
are nondirected* 

The second criterion is how many types of links are used* More than one 
kind of relation may be used in a gr^.ph* In a kinship diagram/ for example/ 
■■cousin of" and "brother of" may bo. . be present* In a computer flowchart/ 
only a single arrow — indicating which operation follows ano^^r — may occur 
following an operation* 

The third criterion concerns the type of mapping used* There are three 
classes of mappings: 

One:One, ManyrOne (or One:Many) and Many:Many mappings/ which we will 
consider in turn: One:One mapp_..„ * In this case links in a ch\rt might indi~ 
cate how husband and wife pairings occur by drawing lines connecting points 
representing the location of each individual at a cocktail, party* 

Many:One or One:Hany mappings: In this case* it if* /important to consider 
separately directed and nondirrsted links^ With directed links/ inclusion 
relations may be indicated by a M>:*r2y?0nfs Knapping such as occurs in a hierarchy 
where many objects are orgari2e<? videt a superset* with nondirected links/ 
collateral relations are £>Kfxcated* If all diplomatic relations were symmetri- 
cal, links on a m .p illustrating the diplomatic relations of any one country 
would represent this sort of mapping* 

ManyrMany mappings: In this case/ the multiple affiliations of a number 
of different objects can be represented* For example, a chart might represent 
different social classes by a drawing of a typical member of each/ and might 
represent different social institutions by drawings of typical buildings (e*g*/ 
a church or a bank) * Lines could connect the people to the institutions to 
which at least a majority of the represented class belong* 

In charts, then/ the nature of the mapping must be clearly indicated by 
the specifiers* Too many arrows can obscure mappings among elements, as some- 
times happens in tangled organizational charts* Directionality and specific 
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meaning (achieved via labels) may be important/ and clearly defined links are 
always important* In actually describing a chart or graph/ we are careful to 
consider what kind of information is being conveyed (hierarchial/ relational, 

^ etc*)* We then consider whether the marks effectively convey the meanings of 

the relations among the marks as the graph maker intended* 

In the course of describing the overall organization among the constitu- 

^ ents, we take special care on the following points; 

Organization among the framework and labels * We consider how labels serve 
to interpret different aspects of the framework* Each label type is examined 
separately* 

Organisation among the labels and specifier* We consider how labels serve 
to interpret different aspects of the specifier* Each label type is examined 
separately* 

Organization among the labels, framework, and specifier* Finally, we 
examine the overall configuration of the display, investigating whether graphic 

^ relations among depictions convey the intended meaning* We ask whether all 

associations among adjacent or overlapping material are clear, if the graph is 
not intact (perhaps because adjacent material on the page occludes part of it), 

^ and if it is difficult to read* 

THE PRAGMATIC ANALYSIS 

» 

As in language, not all the information humans gather from charts and 
graphs i s dictated by the literal interpretation of the marks oh the page* If 

ft 

w the number of war dead were indicated in a bar graph by increasingly higher 

piles of bodies of dead children, to take a grisly example, the reader would 
probably not simplv register the literal information conveyed by the height of 

ft 

w the column* Similarly, if one bar in a bar graph were printed in bright orancje 

ink, and two others in dull gray, that bar would be hard to ignore* This 
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"pragmatic" aspect of communication with charts and graphs has been discussed 
at some length by Huff (1954) in his classic book, How to Lie with Statistics * 
The operating principles offerred here were determined primarily by considering 
the kind and order of the description of the lines a person would build up, 
with >*agraatic "connotations" arising from these descriptions* The principles 
were then tested by constructing demonstrations in which visual properties were 
manipulated to produce descriptions at the semantic level which emphasize some 
parts of the information at the expense of others* often to the point of being 
misleading* 

Some of these principles have rather direct correspondences to similar 
principles underlying language (see Grice, 1967)* 
Operating Principles 

Two classes of principles capture the relevant pragmatic uses of charts 
and graphs* The classes contain numerous individual principles/ however, and 
thus we shall defer discussing them until chapter 4* The classes are: 

A* Principles of invited inference 

Although a chart or graph may not mislead on the semantic level, it may 
invite us to misread it anyway* Thi*: ^ ^ done in numerous ways; truncating 
scales so that small proportional differences appear larger; varying the type 
of scale used (linear vs* logarithmic, for example); using inferred 3-D proper- 
ties of a display so* that we see things as bigger than they are/ and so on* 
Some of these principles are directly reflected by Huff's 0954) advice about 
how to lie with statistics* 

B* Principles of contextual compatibility 
,-;*>Most graphic displays are embedded in a context, either in text or in an 
oral presentation* The context and the semantic interpretation of the display 
must be compatible or comprehension of the display will be impaired. 
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Applying the analytic scheme 

We again consider first each of the four basic-level constituents, and 
then turn to questions about the organisation among them* litis analysis dif- 
fers from the foregoing ones in an i^ortant respect; The syntactic analysis 
resulted in a rather rich description of the chart or graph itself* This was 
necessary because many of the elements of the syntax fed^into the semantic 
properties, and, hence, we needed to have the chart or graph described in a way 
that would allow us to consider each of the semantic principles, ht the level 
of the semantic analysis, there was much less description per se* And only 
some of the semantics of some aspects of the chart or graph are relevant for 
this later pragmatic analysis* The pragmatic analysis itself, then, produces 
very little in the way of description of the chart or graph* Rather, the exis- 
ting description is now rich enough, from the level at which the thickness and 
color of the lines is noted to the level at which the elements are interpreted/ 
such that we can simply ask questions that probe for violations of specific 
principles* Thus, this analysis consists entirely of questions, as indicated 
below* These are "leading questions 1 * in that the answers reveal violations of 
the operating principles described above* 
The Background 

~*oes the background imply information not explicitly stated in the display 
(e.g., as might occur if the background was a photo of war dead)? 

Are the implications of background material consistent with the message 
and the content? 
The Franework 

Does the form of the framework lead the reader to extract the intended 
message easily? 

Is there a truncated axis? Does this emphasize small proportional 
differences in ways not intended by the graph maker? (Note: sometimes graphic 
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displays make a point in part by emphasizing certain small differences; in some 
cases this may be misleading, in others, not* 

Are scales distorted? Is this compatible with the point of the chart or 
graph? 

Are th lue marking indicated sufficient for intended purposes? 

Are the marks used to represent a given element of such a form that they 
lead the reader to distort relative comparisons? 

If the framework is also serving to depict, does the meaning of the depic- 
tion t.alp or hiivter understanding the content of the chart or graph? 
rhe Specifier 

Are some equivalent elements made to appear more important than others (by 
color, width of lines and so on)? Is this appropriate given the point of the 
chart or graph? Does it help or hinder understanding' its meaning? 

Are marks used to represent a given element of such a form that they lead 
one to distort relative 1 comparisons? 

If the specifier depicts information, does the meaning of the depiction 
help or hinder understanding the content of the display? 
The Labels 

Is the visual dominance and form of the elements of each of the labels 
consistent with the point being made? 

Are some equivalent elements inappropriately made appear more dominant 
than others (by varying color, weight, etc*)? 

Arc words consistent with the terminology of the text? 
General Organization 

Is the meaning implied by the text readily apparent in the chart or graph? 

Does adjacent material on the page distract from or enhance the graph? 

Does redrtfiancy, if present, help or hinder understanding of the graph? 

Is ther^ a deliberate use of perceptual distortion (e.g., of areas)? 
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Ill* TWO EXAMPLES 



In the final section of this chapter, we present two examples of frow the 
analytic scheme is actually applied* In h^tfr example^, we indicate where a 
violation was discovered in the course of ;^nerating the description; viola- 
tions are indica-ted by the word '' VIOLATION " followed by the name of the princi- 
ple violated and the reason that principle was considered to have been violat- 
ed* Note again that not all violations will necessarily impair reading the 
chart or graph at the level of detail intended by the designer* Violations 
reveal difficulties in extracting all of the information potentially available 
in a display, but this may be far in excess of that required to use the display 
as intended* 

The firsti display we analyze is a relatively simple bar graph, and the 
second is a very complex multiple framework chart* Both of these displays were 
taken from u*S* government documents, the first from a Department of Transpor- 
tation manual and the second from a proposed scheme for labeling food products 
from the Department of Nutritional Sciences* in later chapters of this book we 
will not only discuss what is wrong with given charts and graphs, but we will 
discuss how best to correct their faults* Much of the Information necessary to 
correct a given display will be provided in the detailed presentation of the 
various operating principles, as will examples of how thepe principles can be 
ussd to advantage or disadvantage L ^sparing charts and graphs* 



l t Analysis of Figure 2*3 



INSERT FIGURE 2*3 HERE 

SYNTACTIC ANALYSIS 

The following description is for the graph ij 1 is*- in Figure 2*3* 

Note thac if a question in the descriptive scheme - -> clearly inappropriate 
(e*g*, about color when only black and white are used), it is ignored* Simi- 
larly, questions designed for special purpose problems/ such as the relations 
among 2D and 3D depictions, are ignored if the graph includes only 2D informal 
tion, as does this one * 
Background 

Blank white* 
Framework 

There is an outer and inner framework* 
The outer framework 

Elements; 2 vertical straight lines syntactically dense* 

2 horizontal straight lines, syntactically dense* 
Medium weight, black 

Organization: Connected to form a rectangle/ with the vertical axis being 
longer* 

The inner framework 

Elements; 7 straight vertical lines/ syntactically dense* 
Organization; spaced evenly* 
Organization of inner and outer frameworks 

Inner lines cdhnected to horizontal lines of outer framework* terminate at 
those lines* 

The specifier 

Elements: 5 rectangles/ aivided into black and white portions by a 

vertical line, with the left side being black; or * 5 blade 
rectangles and 5 white rectangles* 

VIOL AT IOM ; Principle of Processing Priorities* The width of the bars is visu- 
ally dominant, which is distracting because the width has no information valuo* 
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I 

it ends! 



Organization: Spaced one above the other with the leftmost 

aligned or, the black rectangles juxtaposed to the 
white ones, with the rightmost end of the black 
ones abutting the leftmost end of the white ones, 
and the pairs of rectangles being spaced vertical- 
ly, with the leftmost ends of the black rectangles 
being aligned* 

Labels 

Title : Two fonts are used; Above a large label is a smaller one, part of 
wTuch is a number* 

Key : There is a key; analysis of it is presented in relation to other 
components below* 

VIOLATION : As is evident below, the Internal Mapping Principle is violated* 
It is not clear if the white rectangle in the key corresponds to only the white 
part of the pictorial material (bars) or the ent-lre bar* 
Alphabetic 

IVo sizes of typefonts are used, they will be referred to as either 
"large" or "small*" 

VIOLATION : The Principle of Processing Priorities* The size of the letters 
labeling the two scales is varied arbitrarily, making one more salient for no 
good reason* 



Vertical axis labe-t, small font 
Key labels, small font 
Total distance label, small font 
Horizontal axis label, small font 



Organisation; 



In relation to 
framework, as 
noted below* 



Numeric 



Five in vertical column on loft, large 
font 

Five in vertical column at right, large % v 
font 

Six in horizontal row at bottom* small font 
Number in title at top, large font 



Organization: Rows and col- 
umns, right and 
left columns in 
1 ; 1 correspon- 
dence; in rela- 
tion to frame- 
work and speci- 
fier, as notod 
below. 
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Depictive 



Key: Black rectangle/ white rectangle 

s 



Organization; Adjacent to 



each other, and 
in relation to 
both framework 
and alpha 
labels, a, 
noted be lo&v 




Organization among the different types of labels 



Alphabetic and Numeric 

Lsft: vertical line label above column of numbers* 
Right: total distance above column of numbers* 
Bottom; line label to left of row of numbers* 

VIOLATION Gestalt Principle of Organisation is : milarity) * The size of the 

marks used as labels on the vertical axis and the size of the marks used as 

numbers are incompatible, making it difficult to see them grouped together* 

Numeric Depictive 

No cases * 

Alphabetic and Depictive 

Labels to right of white and black bars* 
vlOL^TIOM t Gestalt Principle of Organization (proximity) , Tfte MPH label is 
not clearly associated wich the vertical scale, being in a non-conventional 
location* 

Organization among the framework, specifier and labels 
Framework and specifier 

Barn abut left vertical line with bars extending to right* 
Bars enclosed in frame* 

vertical internal lines of fraroe do not violate boundaries of rectangles* 

Framework and labels 

Alphabetic 

Title At absolute top, key directly above highest horizontal lin<i of out^r 



t rdfliework * 
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Labels of left vertical straight outer line and bottom line outside frame 
work* Label at top left, at bottom with first letter directly under ex- 
treme point of bottom horizontal line* 

Total distance label at upper right above top horizontal line, centered 
within segment defined by first internal vertical line to the left of the 
right outer line of framework and the right outer line* 

Nune ric 

Column on left regularly spaced outside and to left of leftmost vertical 
line of outer framework* 

Row on bottom under horizontal lower line of outer framework, one number 
under each internal line, no number under last Internal line on the right 
Column on right, evenly spaced, centered between first internal line to 
left and rightmost outer line* 

Depictive 

Above horizontal line defining top of framework* 
Labels and specifier 
Alpha 
No cases * 
numeric 

1:1 alignment of right column of numbers and bars* 
VIOLATION : Gestalt principle of Organization (proximity, similarity, continu- 
ity)* Numbers are not clearly grouped perceptually with appropriate bars* 

Depictive 

Blade and white key labels in same order as black and white portions of 
bars* Hot clear of white box corresponds only to white portion of bars* 

VIOLATION : The Internal Mapping Principle, as noted in the initial comments 

on the key* 

Labels, framework, aid specifier 

Description of the pair-wise relations among the constituents is suffi- 
cient; no special problems emerge from the constituents taken as a whole* 



SEMANTIC ANALYSIS 

r 

We havA new described the basic elements on the page and their organiza- 
tion at a lev i. sufficient to consider how these marks act as symbols* Let us 
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again consider each aspect in turn* We will first begin by considering the in- 



terpretation of the syntactic units just describee. 
Framework 
Outer 

A Cartesi ^ordinate space is defined by tl. - ^rizontal and vertical 
lines * 

The vertical axis represents a ratio scale, with the origin at the top of 
the line* Although this scale is semantically dense, it has been differ- 
entiated into five descrete values with values increasing as one descends 
down the line* 

VIOLATION Principle of Graph Schema Availability* The vertical scale 

violates a common graph form, in which larger values are usually indicated by 

higher marks* The origin of the two axes in a Cartesian space is usually the 

same point (the lower left intersection of the axes), which is not true here* 

The horizontal axis is a ratio scale, with the origin at the left and 
values incr^s^ing as one moves to the right* 

Inner 

The vertical lines mark off increments of distances of 50 feet* 
The specifier 

Length of the entire rectangle represents average braking distance* 

Length of the black portion represents average reaction distance* 

Each rectangle represents a discrete and different speed* 

The relationship between average braking and reaction distance is implicit 
in the relationship between the length of the black and white portions of 
the bars* 

VIOLATION External Mapping Principle* The ambiguity in how to describe the 
specifier on a syntactic level violates the requirements of oar streamlined 
version of Goodman's concept of a notational system, as described earlier* 

Labels 

Elements: 

Alpha 

- - .. << 

English words labeling the values of units on the axis, the meaning of the 
depictive labfl used in the key, and the meaning of the total distance 
column* English words also label the graph as a whole* 
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VIOLATION : The External Mapping Principle* The failure to include the 
word "distance" on the alpha label associated with the left bar in the key is 
misleading as no contrast is intended to the right label* 
Numeric 

Distances in feet and speed in miles per hour* Also total braki * dis- 
tance* The figure is related to textual material by a number at the top* 

\ Depictive 

Color of bars in the key have no intrinsic meaning* 

Organization: 

^ Alpha and Numeric 

Words label scales that the numbers index values on* 

Alpha and Depictive 

p Words label the meaning of the bars in the key via a 1:1 mapping* 

Numeric and Depictive 
No cases* 

) Relationships among the framework, specifier and labels 

Having described the interpretations of the units defined syntactically/ 
let us now consider the interpretation of the relationships among these 
) components* 

Framework and specifier 
Outer framework 

^ The specifier is serving to map discrete values on the vertical axis 

to continuous values on the horizontal one (although both are ratio scales) * 
Two functions are plotted/ and the relationships between these two 
) functions can be computed* 

Framework and labels 
A lpha 

k The labels define the meaning of the -axes* 
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t umeric 

The numbers on the vertical axis serve to differentiate the ratio scale 
into five descrete classes* 

The numbers on the horizontal axis demarcate values on a dense ratio 
scale * 

Depictive 

The bars in the key label and the bars in the framework via a one;many 
map* 

Framework/ specifier material and labels 

The semantic relations are described in the quantitative semantics in a 
straightforward way, as is evident in the descriptions given for the pair- 
wise organization among constituents*" 



PRAGMATIC ANALYSIS 

There are no violations of pragmatic principles evident; seeing the graph 
in context could reveal some/ but we will not consider any such context here* 
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II* Analysis of Figure 2*4 

INSERT FIGURE 2*4 HERE 
SYNTACTIC ANALYSIS 

Analysis int - >charts 

The chart is divided into two subcharts (left and middle) and a cluster of 
alpha and numeric material (hereafter referred to as the right table) * 
The rightmost boundary of the left chart is defined by right justification of 
seven circles and blank space to the right of the circles* The rightmost boun- 
dary of the center chart is defined by annular white space between the small 
radial marVs in the center of the page and the circular justification of the 
alpha ataterial on the right* 
Left Subchart (LS ) 
LS Framework 

The LS framework consists of an outer frame and an inner frame* 
The outer framework 

Elements; A horizontal axis is indicated by the bracket on the bottom* 
Axis is syntactically differentiated* 

Organization: Only one element in outer framework* 

The inner framework 

Elements; Twenty-eight (28) closed curved lines, forming circles* These 
are syntactically dense* Medium weight, black* 

Organization; Circles aligned into columns via proximity* 

VIOLATION ; Gestalt Principle of Organization (proximity)* Proximity 
* 

results in an organization into columns when an orga* nation into rows is 
required* 

Organization of inner and outer frameworrs 
Bracket encompasses inner framework elements* 
Lr> Specif i^r 

Elemnnt^; Black quadrants of circles (i*e*/ subtending 9<T arc)* 
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Organization: Contained within LS inner framework elements* When one of 
these elements appears in a frame, it is positioned in the upper left quadrant* 
As additional elements are added to a frame/ they are placed contiguous to 
prior elements and fill the frame in a counter-clockwise manner* Frames are 
filled from left to right in rows* 

VIOLATION : Gestalt Principle of Organization (good form)* At first glance, 
the inner framework leads one to divide the quantities into fourths/ which is 
incorrect* 
LS Labels 

Only alpha and numeric labels appear - there are no depictive labels* 
Since alphas and numerics appear in the same perceptual units/ separate syntac- 
tic discussions seem inappropriate* 

One typefont (medium weight, black) is used within this subgraph and 
alphas may be upper or lower case* 

Elements; Subchart title - "Nutritional cont*" The first letter is upper 

case/ .remaining letters are lower case, a 
period appears last* 
Organization; Letters have upright orientation and are arranged in two 

groups in a closely packed horizontal string* 
Elements: Seven vertical axis (row) labels are mixed upper and lower case 

with periods and numerics intermixed* 
Organization; Letters have upright orientation and are arranged in one or 
two groups in closely packed hori^ntal strings* Labels 
are left justified at the same column* 
Elements: Horizontal axis label - "needed per day" is co ^ced of lower- 
case letters* 

Organization: Letters have upright orientation arranged in three 
groups in a rlosely packed horizontal string* 
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Organization Among Different LS Label Elements 

LS title is left justified in the same column as the vertical axis labels* 
The space left between the title and the top vertical axis label is only 
slightly greater than the space between the various vertical *' * labels* 
Organization Among the Framework, Z ,ecifier and Labels 

Framework and Specifier 

The dark quadrants of circles are contained within inner frame elements, 
as mentioned above* 

Framework ^nd Labels 

The title is just above and commences to the left of the array of circles* 

The horizontal axis label is below the bracket* 
VIOLATION : Gestalt Principle of Organization (proximity, similarity)* Both 
the position 6f and use of the same typefont for all labels impairs identifying 
the subtitle as distinct* 

Labels and Specifier 

The specifier is not labeled* 
Middle Subchart (MS) 

MS Framework 

The outer framewor k 

Elements: 40 short lines, approximately equal in length* The frame com- 
promised of these elements is syntactically differentiated* 

Organization: The .lines project outward from a common center and extend 
from a common distance from the center and to a slightly 
greater common distance from center* The lines are 
separated by approximately equal angles, but the separating 
angles are discriminably different* 

VIOLATION : Principle of Processing Limitations* That there are exactly 40 

short marks Ln this frame is not immediately apparent, but is Important in 

order to understand the chart* 

T hQ inn ^ r f_ram^work 

None 
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MS Specifier 

Clements: Two "pie-slice" wedges ; one blacks one white* The black is 
slightly larger than the white* 

VIOLATION : Gestalt Principle of Organization (good form)* Failure to include 

the rir.1 the white wedge impairs seeing it as a wedge* 

Organisation: The curved edges of the wedges are contermin . with the 
distal end of the .frame elements* The vertex of the black 
wedge points straightdown while the vertex of the white 
wedge appears to point straight up* The vertices are 
joined* 

MS Label s 

No labels are present within the subgraph* 
Organization Among Different MS Labels 
Framework and Specifier 

Both wedges have vertices which coincide with the center of the circle 
defined by the frame* Both wedges obscure the short radial lines which define 
the frame* 

Framework and Label s 
■ Mot applicable* 

Labels and Specifier 

Mot applicable* 
Ri ght Table (RT) 

RT Framework 

No explicit framework, outer or inner* 
RT Specifier 

There is no specifier in this table* 
RT Labels 

There are both alphabetic and numeric labels in this table* Mo depictive 
elements appear* Two type fonts are used; One is small light upper case, the 
other is largo bold lower case* All letters and numbers in th£ same cluster 
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have the same typefont* Right justification is apparent for entire table, with 
the exception of the digit "8"* 

VIOLAT ION t Gestalt Principle of Organization (proximity, good continuation)* 
The "S" being out of line in the top cluster leads one to focus on*? 1 ? attention 
on it, for no good reason* 



Elements: Three rows of small, upper case type are at the top* Spacing 

divides these rows into two columns* Alphas appear in only one 
(top) string of right column* (numeric "8" is also in right 
column) * Beneath these are three more rows, bold type, in 
lower case* Spa^ _ng again produces two columns* Beneath these 
elements is one row in bold lower case type* 

Organization; Typefont and weight of lines serve to define three groups, 
as noted above* ttie top group is organized into a row or 
one line and a row of two lines (by the Gestalt Law of 
Proximity). The middle group is directly beneath the 
first, being aligned on the right margin* The final line 
is separated from the rest of the table by a large gap* 

VIOLATION ; Gestalt Principle of Organization (proximity) * The large gap 

separating the bottom line of the table impairs one realizing that it belongs 

to the table* 

VIOLATION ; Principle of Processing Priorities* The difference in font size 1 
between the upper and middle clusters direct one*s attention to the middle 
cluster first, instead of the top one* 
Numeric 

Elements: Numerics appear in each cluster* 

Organization; when more than one numeral appears in a string, they follow 
one another in sequence* They appear in the right-most 
perceptual uni_ of the table, except in the bottom line* 
RT Organiz at ion A mong Different Typ esof Labels 

Alphabetic and Numeri c 

Numerics, when present, are intermixed in the same perceptual units with 
aiphabetics * 



Macro-Orqaniaa tion 



Having discussed the syntax of the various subcharts/ we return to overall 
structure of the three* 
Framew , 

Elenents; Two heavy black lj^es composed of a short vertical segment and 
a longer horizontal segment ending in an arrowhead* 

Organization: One line originate** at the center of the rim of the black 
wedge and terminates at the left in an arrowhead, which 
points at the right-most part' of the title of the left 
subchart* The Lower line originates at the center of the 
rim of the white wedge and points at the left-most end of 
the bottom line of the right table* 

Labels of Macrof r am ework 

Only alpha and numeric labels appear - there are no depictive labels* 
Since alphas and numerics appear in the same perceptual units/ separate syntac- 
tic discussions seem inappropriate* 

Elements: Title - the title is comprised of two perceptual units/ one in 
small upper case, one in very large upper case* 

Organization: Upright orientation/ arranged in horizontal strings com- 
prised of two or more closely packed groups* Small type- 
font is centered in the page and above very large typefont* 

VIOLATION t Gestalt Principle of Organization (similarity/ proximity)* The 
title is not clearly identified as such. It should be either set off from the 
chart proper and/or be in a heavier typefont* 
Org anization of Macr of ramework and labels 

Both perceptual elements of* the title are centered above the framework* 

Overal l Organization 
VIO r aTION: Principle of Processing Limitations * There is too much information 
to ss at one*;* 4 
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SEMANTIC ANALYSIS 

Lei^^chart ^ 
Frame work 
Outer framewo rk 

Vertical axis (implied by white space to the loft of the left most column 
of circles) constitutes a nominal scale* This scale is semantically differen- 
tiated (although dif ferentation is de-emphasized perceptually by wider spacing 
row-wise than column-wise, as noted earlier)* Horizontal axis constitutes a 
ratio scale and is semantically differentiated* The extent of this scale rep- 
resent daily nutritional requirement of given nutrients* The brackec functions 
as a way of indicating the scope of the label on the bottom, as will be 
di^cucsed shortly* 

Inn^ r framework 

Each circle in a row may contain as much as 1/4 of the daily requirement 
for a given nutrient* The circles are^thus ratio scales and are semantically 

S 

differentiated* 

VIOLATION : External Mapping Principle* The semantic differentiation is made 
apparent only through the relationship of the specifier with the inner 
framework* The perceptual representation of these circles actually falsely 
suggests a dense scale b/ the lack of differentiation marks on the circle* 
The s pe c ifier 

The basic specifier unit (a black quadrant of a circle) represents 1/16 of 
the daily requirement for a given nutrient* Basic specifier units can be com- 
bined to indicate integral multiples of 1/16 of the daily requirement* 

VIOLATIO N: Principle of Schema Availability* The nesting of quadrants within 

_ ^ 

each of the four circles is a nov^l way of specifying the information, and 
hence, must be clearly specified* 
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L abels 

Elements: 

Al phabetic 

English *s are used in the t* ' le to Inform the reader that the subchart 
provides information on nutritional <■* stents* They are also used to name the 
various nutritional components represented as rows of circles and to inform the 
reader or the meaning of the horizontal axis* 

Periods {*) inform the reader that a sequence of letters is an abbrevia- 
tion of an english word* 

Numeric 

Numerals appear as characters which, in part, for^ the names of the nutri- 
tional components * 

0 janization ftmong f ferent Types of Labels 

Alphabe ti c and jjumeric 

Together comprise names* 
Relations hi ps Among the Frame wo rk, Specifie r , and Labe ls 

Ou te r_f r a m e wo r k and inne r fr ame work 

The bracket can be interpreted £s unifying the collection of four circles 
into one dimension (along the horizontal axis of the inner framework) * This is 
a Gne:Many mapping. 

Framework and Specifier 

The basic specifier units (black quadrants of a circle set ir* conjunction 
with tfu* four circles in each row to indicate the extent to which one serving 
of the food Lttim satisfies the daily requirement for a nutritional component 
associated with the row* 

Fra mewo rk and Lab** I s 

Alphab etic 

The alpha labels defuw the meaning of the axio. The bracket inckcat*v. 
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that the horizontal axis is defined by the English words immediately beneath 
it* This is a One:One mapping* 
Numeric 

Act in concert with alpha to name nutritional components represented by 

rows * 

F ramework and^Speci f ier 
The specifier is not labeled directly* 
M iddle Subchart 
Framework 

This framework is ambiguous^ Ifte only interpretation that is consistent 
with the other subcharts in the display is that this one framework represents 
two distinct entities* One entity (the top part) is the total daily nutrition- 
al requirement f or a person* The second (the bottom part) is the total daily 
caloric requirements for a person* 

Accepting these interpretations r the framework , would constitute a ratio 
scale* 

While the £rame appears syntactically differentia ted, on the semantic 
level, the issua of denseness and diff erentation appears completely indetermi- 
nate in the context of all information present or derivable* 
VIOLATION : External Mapping Principle* The ambiguity mentioned aoove is due 
to faulty mapping from syntax to semantics* 

VIOLATION : External Mapping Principle* ifte variation in spacing between the 
marks of th*> Ltame seems to have no meanxng* 
The Sp eci fier 

The black wedge represents the proportion 0 f the total daily nutritional 
requirements supplied by a serving of th* food in question (This interpretation 
the only one consistent with the connective relation between the black wedge 
and tte i ft ft ^ub^raph,) 
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The white wedge represents the proportion of the total daily calo.ic re- 
quirements supplied by a serving of the ¥ood in question* This interpretation 
is uncertain, however, but is suggested by tne fact that, the arrow from it 
points to the bottom line of * table on *-'ie right* 

VIOLATION : External Mapping K-riciple* l. ^ meaning of the wedge simply is not 
clearly defined on the syntax or the semantic context, allowing one to inter- 
pret the meaning of the syntax in more than one way* 

VIOLATION : Principle of Graph Schema Availability* A circle or "pie" chart is 
usually used to show how a whole is divided into parts* Hie middle subchart, 
on the other hand, does not use wedges to divide a single entity into parts, 
but rather treats the two wedges as independent* 
Labels, 

Ho labels of any sort are wholly within subchart* 
V IOLATION : External Mapping Principle* Hissing labels on both the framework 
and the specifier make this chart very difficult to understand* 
Relations hip Be t ween the Framework and; Specif ier 

According to the most consistent reading, the specifier elements represent 
two distinct entities: (1) proportion of daily nutritional requirement supplied 
per serving (black wedge), and (2) proportion of daily caloric requirement 
supplied by a serving (white wedge) » The frame represents the whole daily 
requirement of these two entities (nutrition and calories) and, therefore, sup- 
plies ratio scales in which both specifier elements are measured* The differ- 
ent sizes of the two wedges is thus explained* 

VIOLATION: External Mapping Principle* if this interpretation is correct, the 
scale is different things to different objects, and therefore, violates the 
disjointness property required for systems of symbolic notation to be unam- 
biguous* 



70 



VIOLATION : Internal Mapping Principle* The wedge-shaped specifier elements 
obscure the hash marks which comprise the outer framework* This prevents any 
quantitative mapping from specifier to frame* 
Frame work a nd L abel s 

The frame is not labeled in this subchart* If it he oeen, two different 
labels would have been required for the same framework or the framework would 
have to be divided into two semicircular frameworks, each separately labeled* 

Labels and Specifier 

The specifier in this subchart is not labeled within the subchart* 
Specifier elements within the subchart are connected to labels in other 
subgraphs and derive meanings thereby, as will be discussed shortly* 

Right Table (RT) 



,RT Franework 



There is no actual framework* 

RT Specifier <> 

There is no specifier* 

RT Labels 

Alphabe tic 

The labels in the upper cluster are English words which specify quantities 
of food- The labels in the middle cluster are English words for abbreviations 
which art; names of nutritional components of foci* The symbol "g" indicates 
"grans * H 

The lower label is an English wor'J meaning a unit of heat (in this con* 
text, the hoat equivalent of a serving of food) * 
Uun^r ic 

The numerics are arabic numerals specifying quantities* 
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RT Organ iza tion Among Different Types of La bels 
Alphab e tics and Numeric s 

Alphabetics and numerics appearing in the same perceptual units together 
specify a quantity of: some type of physical un i ^ (e.g., "4 grams"). These 
units in turn specify how much of the named ^ Lance associated with the quan^ 
tity in a serving. 
Macro-Organization 

Framework 

One arrow associates the white wedge with the "170 kilocalories" label. 
This, in fact, allowed us to infer the meaning of the white wedge. 

The other arrov associates the black wedge with the entire left-most sub- 
chart, which provides an analysis of the total daily requirement of the nutri^ 
tional components. 

VIOLATION i External Mapping Principle. The lack of labels on the arrows im- 
pairs one from realizing that they symbolise different relations, "decomposes 
into" (top) and "corresponds to 11 (bottom). 

VIOLATION : Principle of Graph schema Availability. Arrows point from speci- 
fier elements to labels in place of the more conventional directions from label 
to specifier elements. 

Labels of Macro-frame work 

Elements : 

Alpha and N umer_ic_ 

The title identifies this display as the third in this chapter, and labels 
the infornatxon provided by the entire set of charts. 
Overall O rganiz ation 

VIOLATION : Internal Mapping Principle. Labels are missing that are necessary 
to^cooedinat^ the subcharts into a single cohesive display. 
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VIOLATION : Internal Mapping Principle* One cannot easily relate the in- 
formation about protein in tht, jht table to the information about protein 
in the left chart, partly because of the use of "prot*" and "orotein" in 
the different subcharts* In general, use of different notations or 
abbreviations lends one to infer that different things are being talked 
about* 

VIOLATION : Principla of Internal Mapping* One must realize that there are 
forty marks comprising the frame elements in order to construe a consistent 
relation between the left and middle subgraph (in terras of nutritional 
content)* The marks should have been emphasize- (e*g*, every tenth made 
bolder) to facilitate this realization* 



PRAGMATIC ANALYSIS 



There are no clear cases where the display has been slanted to lead us to 
draw incorrect inferences or attend to specific pieces of information more than 
others* We cannot know whether the pragmatic principle of contextual 
compatibility is violated because we do not know the context in which the 
display occurred* 
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CHAPTER 3: SYNTACTIC PRINCIPLES 



I* Seeing the lines 

1* Adequate discriminability 

a) Relative distinctions 

b) Detecting marks 
2* Perceptual distortion 

a) Optical illusions 

b) Systematic distortion 
II* Natural units 

1. Gestalt laws of organization 

a) Good continuity 

b) Proximity 

c) Similarity 

d) Good form 

2* Integral /separable dimensions 
III* Processing priorities and limitations 
t * Priorities: salience 

a) weight and noticeability 
2* Linitations: fixed capacity 

a) 7+2: "finite capacity;" 

b) Comparing units or parts thereof 
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CHAPTER 3 j SYNTACTIC PRINCIPLES OF CHART AND GRAPH CONSTRUCTION 



In this chapter we begin to consider principles that must be obeyed if £ 
chart or *graph is to be readily comprehensible* The principles specifically 
addressed in the present chapter concern how lines on o. page ire s*#en, organ- 
ised, and held in mind* In the next chapter we will consider how such patterns 
are interpreted as meaningful units and how conceptual and quantitative infor- 
mation is extracted from them* In Tsoth this and the following chapter each of 
the principles we present is illustrated by at least one "before and after" 
pair of displays, demonstrating how a violation of the principle clearly 
impairs graph reading, and how such violations can be repaired, thereby improv- 
ing graph reading* Thus you* the reader, are in a sense a subject in an infor- 
mal experiment; if you clearly agree that our repair of the "before" graph 
improves its legibility in your eyes/ we may take it as prima facie empiiical 
support for the validity of the relevant principle* This methodology has been 
employed successfully in the study of linguistics and in the study of percep- 
tual illusions constancies, and organising principles* In addition to these 
demonstrations, in the sections to follow we summarise the available empirical 
findings— in the literature at large and on charts and graphs in particular — 
that bear on each principle and we present new data bearing on each principle* 

The syntactic operating principles all rest on facts about how w & see and 
encode visual information* Thus/ the support for these principles is of two 
kinds, direct investigations of chart* and graphs per se and more general stud- 
ies of hunan visual information processing* The relative paucity of research 
on charts and Graphs is more than balanced by the richness of our knowledge 
about visual perception* Hence, we are in a position to formulate the syntac- 
tic principle with a hi-jh degree of confidence* m **ach case we can not only 
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marshall evidence that t*ie principle is correct, but provide details about how 
to avoid violating the principle and how to uake use of it in effective presen- 
tation of information in graphic displays* 

In the remainder cf this chapter we will consider *■" **ee general classes of 
principles* The first class of principles must not be violated if the lines on 
a page are to be seen correctly* These principles deal with the acuity of the 
visual system and with the way in which the lower levels of the visual system 
systematically distort the simple attributes of what we see* The second class 
of principles specify the factors that determine how we group marks into Jnits* 
These grouping principles are especially important because they determine 
whether the basic-level graphic constituents and relations among them (e*g*, 
which part of the display is labeled by a ^ven word) will be detected easily. 
The third class of principles outline factors that determine the priorities an<3 
limitations of visual processing* These last principles deal not with percep- 
tual processes per se, but rather with the process of encoding information into 
memory* In particular, we consider the limits of "short-term memory", which 
place real constraints on how many uni :s a graph maker can sonsibly expect a 
reader to process at one time* 

Thus, in this chapter we trace the path of visual processing of a graphic 
display, beginning with very low-level physically-defined attributes aid ending 
with attributes that are fairly removed from the eye and visual r stem per se 
and more closely linked with abstract conceptual thought* This path, from 
outside to inside, will be further charted in the ensuing chapter when we leave 
the realn of perceptual processing altogether and consider the linguistic and 
conceptual undorpi.nings of graphic comprehension* 
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I* Seeing the Lines 



Two general principles codify factors that affect how well we see the 
lines that comprise a graphic display* The principl e of adeq uat e discri min- 
abxlity specifies the size of the dxffei ..ce between two marks that is neces- 
sary for us to detect it, and how pronounced a mark must be to be seen at all* 
The principle of perceptual disto r tion specifies how the visual system syste- 
matically distorts some visual dimensions, leading us to make increasingly 
larger errors when comparing marks of larger magnitudes* 
1* . he Principle of Adequate Discriminability 

There are four different ways in which visible marks can vary., and associ- 
ated with each are many different dimensions that potentially may be used to 
code information* First, a mark may vary in its quality * For example, differ- 
ences in color or visual texture of a particular mark can convey information* 
Similarly, the position of a mark on the page may be informative* Second, a 
mark may vary in intensity * Brightness, lightness/ and density or numert.sity 
are dimensions along which intensity of a mark may vary* Third, a mark may 
vary in its extension , such as its length, area or volume* Finally, a mark may 
vary in du ration , which may be important in dynamic displays such as Traffic 
Situation Displays (warner, 1969) and so-called "kinostatic" or time-varying 
grapho discussed ~n Biderman (197 1 ) and Warner and Thissen ( 1 931 ) * 

If a mark is to map uniquely into its corresponding "compliance class" at 
the semantic level of description, variations along any dimension must be per- 
ceptually different* That is, the reader must; be able to detect differences in 
magnitudes of intormat ion-conveying marks* Thus, good graph making will bo 
aided by data on human abilities to detect and discriminate variations along 
the physical marks* For example, data on the smallest point or difference in 
length that a person can detect under normal viewing conditions will help 
ensure that displays are legible, especially if the original display is reducod 
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in size for publication* In this case, the data define limits in our ability 
to make absolute discriminations , to detect the presence of a mark* Similarly, 
if a comparative judgment is to be made of differences among marks, then data 
on minimum perceptible differences are necessary to ens>" **hat there will be 
no ambiguity in difference judgments, including cases in which the graphic 
display is reproduced at different sizes* This corresponds to limits in our 
ability to make relative discriminations* 

■ . / 

The limits in our abilities to make discriminatory are consequence of 
the nature of our perceptual system/' Forexample, before the physical mark can 
even affect the sensory receptors- cf the eye (the "rods" and "cones"; see Kling 
and Riggs, 1971), it must be projected onto the retina* This projection is 

i 

accomplished by the refractory properties of the lens of the eye and changes in 
these refractory properties causea by accommodation. Because of factors such 
as optical def ects t deviation from sphericity of the refracting surface, 
scattering, and wavelength-dependent properties (Field and Magoun, 1959), the 
quality of the retinal image is necessarily degraded, limiting uhe resolving 
power of the visual system as a whole* This degraded retinal image is 
transduced by the retinal cells into a frequency code of all-or-none action 
potentials which are then transmitted via the optic nerve lateral geniculate 
nucleus of the thalamus* and optic radiations to the visual cortex for further 
processing* Anatomical and physiological properties of the receptors " 
themselves (Abramov and Gordon 1973) and mechanisms of neural transmission and 
decoding (Aidley, 1971) contribute to further limitations in our ability to 
detect <ind discriminate variations in the physical properties marks* 

Heurophysiological phenomena allow us to explain* in part, some of the 
reasons for finite discrimination, but because of their complexity and our 
limited unders tending of the mechanisms involved, we cannot yet **se them to 
explain all the perceptual data* It thus becomes necessary to analyze perfor- 
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mance at the level of the entire visual system, especially because the behavior 

\ 

of the sy^tciu as a whole is of prime concern here* Ons way of p£is>ceedintj at 
this level xs to treat the human as a measuring instrument for visual inputs 
and *■ Vscribe the performance of the input-output behavior of this instru- 
ment* For absolute and relative discrimination tasks, the inputs are marks 
varying along any dimension, and the output is the response of the individual 
to questions about the presence or absence, difference or sameness, of the 
marks* Data are then obtained by varying the magnitudes along particular 
dimensions and noting the minimum variation that elicits a qualitatively 
different response* 

In the remainder of this section we consider these two topics, absolute 
and relative discriminations* For each topic, we briefly discuss the concept 
of threshold and how it may be measured* Then we present data on thresholds 
for various physical dimensions, as well as contextual factors which influence 
these thresholds* At the same time, we present examples illustrating how the^ 
data may be exploited in designing unambiguous graphs. 

Absolute Discrimination 

Threshold Determination * 

The relevant research on our ability to make absolute discrimination 
hinges on the notion that there exists a fixed sensation magnitude, or thres- 
hold, below vhich a stimulus is never detected (sensed), and above which a 
stimulus is always detected* If a series of stimuli are presented with magni- 
tudes near the threshold, there should be a wel\ ^fined separation of those 
stimuli that are sensed and those that are not sensed* Th* point which divides 
stimulus magnitudes into those which are "sensed" and those "not sensed" is 
called the absolute threshold* 
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Althorgh the absolute threshold is theoretically lixed at some point on 
the stirmlus magnitude ct luum, measuring such a threshold is in no way a 
trivial task. The measuring instrument is a human subject whose responsa is 
not completely predictable or reliable. The effect of this Is t v t the 
threshold is obscu 2 d by the "noise~produci ng " \ ar lab! li ty t And h ~a tis t ical 
measures nu^C be ustfd to extract the actual threshold. We will not discuss 
^ re the actual procedures used to compute the thresholds; the interested 
reader is referred to Lrce and Galanter t l 963 ) . Furthermore, the very assump- 
tions about the existence of a fixed threshold and the proper way of measuring 
it have beer called into question, and modern researchers use the more sophis- 
ticated assumptions and techniques of the Theory of Signal Detection, which 
assigns a central role to the inherent statistical variability of the visual 
system and to the biases and motivations of the perceiver (Green and Swets, 
1966)* However, for our purposes, which are to glean rough estimates of the 
resolving pow^r of the visual system for us~ in the design of readable charts 
and graphs, we may innocuoasly adopt^'th^ "classical" assumptions about sensory 
thresholds * 

The two po*>- important thresholds for graph construction are visual acuity 
and contrast * Data on acuity and contrast are important when considering legi- 
bility of laoels and pictorial material, as is described below* 
21 ' Acuity 

Maxinin visual acuity is defined as "the smallest visual detail that we 
ar^ capable of revolving at a specified distance* 11 Visuil acuity is expressed 
by tho visual w^** in minutos of arc subt^ndwd by the physical stimulus s* 
}\>r a rou-jb *tnvz* of the nteasjrer hold out vour thumb at arm's length/ it suo- 
U*nJ> ,K)v;' 2° **isual rirc. Ther»3 arr> 60 minutes ber degroo of arc. For 
sritl an^l**s, vt'iual \x, [m .unu'oi; of arc) 10 computed dttiruj tho fol jwitv? 
t j r «^u La ; 
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e m (57.3) (t>0) S (1) 
D 

where S ls the physical size of the mark referred to as the distal size a id D 
is the distance from the eye to the mark* The constants in the formula convert 
the units ual angle from radians to minutes of arc* 

For present purposes, it is critical to note that detectability does not 
ensure legibility. Identifying a marK as being of a particular type is more 
difficult than merely noticing that some figure is present. The literature on 
legibility is well documented in the Human Fac ors literature (see Smith, 
\973) t and standards such as Military standard 1472B (1974), established by the 
Department of Defense, are routinely available* Table 3*1 sumarizes some of 
the recommendations for sises of display letters* As a rule of thumb, unde- 
normal viewing conditions one can assume a standard acuity of one minute of a-c 
(Thomas, 1975)* Given this specification, in order to recognize the details of 
the capital letter "E", for example, its vertical size would have to subtend at 
i'.ast two minutes of arc, one for each pt --r of it% horizontal strokes* 
Hov/ever, a "standard" acuity of one minuto of arc corresponds to a detection 
probability of only seventy-five percenti if near-certain detection is wanted, 
1*6 minutes of arc should be specified, making our letter "E" subteM about 
four minutes of arc* This corresponds to 0*021" seen from 18" away* 

INSERT TABLE 3*1 HERE 

Lee us ^oruiiier an example of how we might use the data on visual acuity 
to specify the type font necessary to ensure adequate legibility of the label 
AliSLf; for a display reduced by a factor of 2:1* If wo as^um^ a normal viewing 
distance? <>f 18", then a, 2;1 redaction results in ?n equivalent viewing distance 
of 36", via previously determined that recognition of the letter "E" required a 
irininun ot A minutes of Therefore, at a viewing distance of 36" tht* 

re-juir^d typ* font mu<*t b^ 0*17" using equation And m fact, research on 



81 



m 



© 



reading has shown that character sizes should be between .06" and .17" for 
maximum legibility (Spencer, 1969)* 

As an illustration of how the principle of adequate discnminabili ty can 
be violated by a graph, consider the set* of graphic displays shown in Figure 
3*1 taken from an article by Wic" *is and Kessel (1977)* At this level of re^ 
duction, the labels "Kits" and "Misses" associated with the key subtend a vis- 
ual angle of approximately 4*5 minutes of arc* But tj identify the letters 
correctly 75% of the time the visual angle must subtend at least five minutes 
of arc* Thus, these labels begin to violate the boundaries of our identifica- 
tion abilities and at a normal viewing distance of i8 M the reader will notice 
that it does take some effort to make accurate? identification* Compare this to 

1 

the improved version on the right; this should be much less work to read* 

INSERT FIGURE 3*1 HERE 

Luminance 

The trend to computer graphics has led us to consider luminance as an 
important contextual parameter affecting acuity* Luminance is the amount of 
light per unit area reflected from or emitted by a surface (this measure is 
frequently referred to as brightness/ although brightness is the subjective 
sensation to changes in the physical energy of light)* Luminance is expressed 
in a variety of units for which conversion factors are given in Table, 3*2* The 
thrae pc^ferr^i units of luminance are the Lanbert, Millilambert aftd the Foot- 
Lumber t * 

INSERT TABLE 3*2 HfcRE 
The L.inb^rt (l) is defined as the unit of luminance equ f >l to that of a 
perfectly diffusing and reflecting surface illuminated by a standard candle at 
a di^tan^c ot one centimeter (cm)* Hie Mi Hi lambort (ml) is one thou^,mith ot 
a Larub^rt* Th*> Foot-Lambort (ft-L) i~ defined as the unit of luminance eq-J.il 
"tu that i£ a perfectly diffusing and reflecting surface illuminated by onr* 
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foot-candle. Normal reading light is "bout 10 ft-L* The luminance values 
experienced in a number of common situations are given in Figure 3*2* 

INSERT FIGURE 3*2 HERE 

No-e from the ^Lgure that as we move frc to high luminance levels/ we 

move from "rod" to cone" vision* Rods and cones* the tvo types of photorecep- 
tors found in the evti, differ importantly in their spatial distribution and 
functional property** Basically, cones provide <acute vision during daytime 
luminance levels/ whereas rod vision is most sensitive to lov luminance levels 
and is essential for night vision* These. and other important function dif- 
ferences *are summarized in Table 3*3* 
' INSERT TABLE 3*3 HERE 

Visual acuity is highly dependent upon the background luminance on which a 
dark detail is superimposed* Figure 3*3/ taken from a study by Moon and 
Spencer (1944), shows the relationship between acuity and background luminance* 
As luminance increases/ acuity increases — partly because the co js become 
active and, as Table 3*3 indicates, the spatial resolution of cones is much 
greater than that for rods* For normal reading light (about 0*1 ft-L), the eye 
can detect an object subtending about 1 minute of visual angle* * 

Ib^tiRT FIGURE 3*3 HERE 

Contrast 

A second factor that must be considered if a display is to be legible \s 
oar ability to discriminate displayed detail from visual background or to dis- 
criminate contrast in brightness* For details darker than their background 
(conrconly the o^ne for cJ* a P nic displays)/ contract ca* 1 vary from 100 percent 
positive, to zero* 



Contr^-it n a irw^aure of <Jifff;renco in luminance between a detail *L1>> ant its 
foac><jro>in-J ILh) ,ind is computed by the formula; 



Contract (%) * LB - LD X 100 



(2) 
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One empirical approach for determining the Limits of this ability is to 
determine the minimum contrast needed, to perceive a particular pattern* The 
sinplest type of pattern is a grating made up of a series of light and dark 
bars* If the Luminance difference between the light and dark bars is reduced 
suf fxciently, there will be a point at which they are just g riminable* The 
point is called the contrast threshold; the lower this threshold, the greater 
the contrast sensitivity* 

Our visual systems do not have a single contrast threshold for all stimu- 
li* Rather, our contrast sensitivity differs depending on the sharpness or 
gradualness of a luminance change, being highest for intermediate degrees of 
gradualness and lower for extremely gradual changes and for extremely sharp 
changes (i*e*, fine details)* This relationship was discovered by observing 
the contrast threshold for grating patterns of different degrees of fineness* 
If the firte^_ss of the grating is expressed as cycles per degree (number of 
ligit dark pairs subtending 1° of visual angle), then for gratings of any fine- 
ness* the contrast can be varied to yield the contrast sensitivity* ft plot of 
this sensitivity for gratings of different spatial frequencies (fineness) can 
then be obtained and is referred to as the contrast sensitivity function 
'(Campbell and Robson, 1968)* Campbell and Robson obtained contrast sensitivity 
functions for many grating nypes, two of which are shown in Figure 3*4* Note 
th-?, ^t intermediate frequencies (changes from black to white), , less than 1 
percent contrast is needed to resolve a patter*.* This is true when sensitivity 
is measured by varying the contrast of a "square wave 1 * grating (black and white 
solid 'itnp^s, witt >harp edges) or by varying the contrasts of a "sxn« wave" 
^atir.7 Mirk and li<;ht stripes that fuzz into each other). 

INSERT FIGURE 3*4 HERE 

To v h«* Htir^ct of contract on acuity consider the c iphs shown in Fig- 
i:^ 1*5* Th" Ffjuro sh^s t i^ntical graprn reproduced unt'*^, Jit'fer^nt. conJi- 
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tions such that one is superimposed on a 'grayish 1 background, the other or a 
■white 1 background* One can clearly see that it is more ditficult to identify 
the labels on the gray background than it is to identify the labels on the 
white background* If we assume that black print on a gray ' -ckground results 
in a fifty percent contrast reduction relative to black prir*t on a white back 
ground, and if the graphs are read in nornal reading light (10 ft-L) at a,/nor- 
nal reading distance, then equivalent identification accuracy is achieved by 
increasing the size of the 'black-on-gray 1 font thirty-one percent (see Figure 
3*5)* That is, if the size of the label ^vehicle' for the 'black-on-'^hite 1 
font is 0*025 inches, the same label must be 0*033 inches if superimposed on a 
gray background (using the daca in Figure 3*5b)* 

ItJSERT FIGURE 3*5 HERE 
The effects of contrast are acute in news magazines because their emphasis 
on "attractive" graphics often results in displayed material appearing on 
colored or patterned backgrounds* ' Observing the following rul^s (Grether and 
Baker, 1972) will help to increase identification against nonuniform back"* 
grounds* First , choose a color and luminance that contrast most with the 
colors in the background* For example, a green trend line on a green back- 
ground will be less discriminable than a red line on a green background. 
Second, pick light colors for specifiers on dark backgrounds and vice versa* 
This is necessary because color contrast is not sufficient to ensure legibil- 
ity* a ligh-n*ss contrast is far more important (Tinker and Paterson, 1931; 
Poulton, . 

We, hav*> rc^ntioneJ that humans are most s^nsitivo to intermediate decrees 
of gradualisms of luminance gradients across the visual field* As no ted * this 
causes ^nall details to be less resolvable at low contrasts than larger 
(W.ail^, Ho^>v^r, th^re is a less Obvious corollary of th£ visual contract 
sensitivity function; very gradual change in lightness will be hard <3otoct 
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at low contrasts as well* this means that topographic maps and other displays 
that vary shading continuously across the page niay have to use large contrasts 
if it is desired that the viewer detect gradual changes* For example, in the 
left panel of Figure 3*6 > change in rainfall across the Great Plains is 
difficult to detect at the contrast shown; the right panel repairs the 
prob lem* 

INSERT FIGURE 3.6 HEBE 

Relative Discrimination 
The acquisition of information from charts and graphs often requires one 
to judge differences in two magnitudes on a single dimension* For example, to 
acquire information fron a bar graph requires that we be able to judge the 
length of bars* A fundamental question here is how small a difference can be 
and still be detected* This difference is called the just-noticeable differ- 
ence t or JttD* For bar graphs, then, this means that there will be some minimal 
difference in the lengths of the bars below which we will be unable to detect 
differences in length, and thus we will be insensitive to information repre- 
sented by such differences* Oar sensitivity to differences in magnitudes 
varies fron dimension to dimension and is influenced by the context in which 
the mark is viewed. In the remainder of this section, we present data on sen- 
sitivities to various dimensions and also discuss some of the contextual 
effects* 

Difference thresholds are obtained by asking people to compare a test 
stinultrj to a standard and noting how small the difference in roagni tui»"? can be 
while still being detected* It is a noteworthy fact about human perception 
that the-ie threshold depend on the magnitude standard stimulus* For example, 
using out b^r grafch illustration* if the length of: a b^r (the "stan^arJ" ) wa ( > 
0*1", then very smalt differences in lengths of another oar \the "conparison" ) 
stimuli (say, 0*001") quite possibly would be detected* If* hoover, thr> 
standard were tO^O 11 * then differences of 0*001" would probably never be 
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noticed* Therefore, if difference thresholds are to be useful parameters for 
guarding against ambiguity, the dependence of threshold and standard must be 
kept in mind* 

E*H* weber formulated a famous law capturing this de q ence in 1346* He 
showed that the change in stimu. **s magnitude (AS) which was needed to trigger a 
just-noticeable change in perceived magnitude along any dimension was a con- 
stant fraction of the magnitude of stimulation (s) already experienced* 
Weber's Law means, for example, that if the proportionality constant for bar 
length was 0*0i* th^rv for a standard of 0*1", a comparison stimulus differing 
from the standard by 0.001" would be detected. For the 10*0" standard, the 
difference must be 1*0"* Table 3*4 lists proportionality constants or differ- 
ential sensitivities for visual dimensions typically found in graphic 
displa ys* 

INSERT FIGURE 3*7 AMD TABLE 3*4 HERE 
The advantage of the Weber fraction as an indicator of differential sensi- 
tivity is Its independence of the actual units of measurement. For example, it 
does not matter whether size is measured in inches or centimeters* since both 
increnent is and actual stimulus magnitude s are measured in terms of the same 

^Weber's Law is expressed asj 

AS = KS (3) 
where o£ is the ji'st noticeable difference (*7ND) * The differential sersi tiv - 
ity to any dimension is obtained from equation (3) by creating the relative 
quantity :,S/S, called the W*ber Fraction; 

K - AS/S (4) 

Theoretically, wh^n VJeber's law is correct* a plot of AS versus S results 

is a constant as shown in Figure 3*7, with greater ordinate values imply- 

ing less sensitivity while smaller values indicating greater sensitivity. 
Howevor, when empirically tested for most sensory modalities, the dashed curve 
in the figuro usually results* ht the point s o , we^er 1 ^ L*w, a s written in 

equation 3, is no longer valul* To cope with this dip in sensitivity 
occurrm j near the absolute threshold, alternative laws (Miller, 1947, 
Guilford, 1 )Z2) hav* boon put forth* Miller intro<Juc*vl whjt now has bf»rron^ 
known as the l^n^ralized Weber's Ljw. 

AS ^ KS + a 

where the constant a is proportional to the absolute threshold* For most 
intermediate ran<je stimuli, though, Weber's law holds quite well* 
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physical quantity, leaving K as a dimensionless ratio* This allows as to com- 
pare relative sensitivities for different physical dimensions* 
Discrimination of Size 

These are numerous different ways of measuring size, each of which will be 
* 

consiQ -*d below* 

Length discrimination 

Length is a commonly used dimension for coding information in graphic 
displays* especially in coding •'point 1 ' information as bar graphs do* Our abil- 
ity to discriminate differences in length is especially important if one must 
make comparisons across graphs with multiple frameworks* Consider the 9raph 
shown in Figure 3*$a, representing yearly fire and police expenditures for some 
fictitious city* Suppose we are interested in kr.owing whether fire and police 
expenditures were the saae for the year 1978* Answering this question requires 
a ccmpariscn of the two bar lengths representing these magnitudes* In fact, 
police expenditures were greater than fire expenditures for that year* How- 
ever, the difference in the two bar lengths is less than a JHD ^, and we can see 
that it is quite difficult to note the difference reliably (without perhaps the 
use of a ruler)* Figure 3*8b shows the sam# information, but this time, the 
difference in length is greater than a JND**, and it appears much easier to noto 
the difference in lengths* 

INSERT FIGURE 3.8 HERE 
Ono investigated the applicability of Weber's Law for line lengths, 

length being specified both in terms of "physical" size and size of th<* invigo 



'if we ,is^unri that the bar on the left is the "standard", its length is 1*7"* 
Tho lenith ot thf; comparison bar is 1*762S", res King in a diff'>renc»> in 
lengths of 0*rK>25"* By Wober's Law and tho diff, *nti,il somitivity to lin^ 
li.nqth of <i*l*, discussed below, the required JHD is: 

AS - 0*041 (i *7; 
= 0*0697" 

*Th<> "qtandari" lunijth is 1*7", and th* oorcp-inson btiwth is 1*77", tho ciifiur- 
^nce Miivi u*o7". Tim; difter^nce is <jr*Mt<*r tnan tho r.^]uircd Jt;l>* 



projected onto the retina as measured by degree of visual angle* He found that 
the value of the JHD was predicted equally well for both specifications of 
size. His results indicate a measure of differential sensitivity to line 
length of 4*1 percent in terms of either size measure* This means thrit if one 
line length is specified at 1", a second Hue - ba specified at l rl" to be 
just noticeably different* Ii» terms of the "retinal" size, if one line subcends 
1* of visual angle, the second must subtend 1*041* to be perceived as just 
different* 

Orientation Effects * The difficulty of discriminating length is doter- 
mirted, in part, by tne orientation of the lines* Consider the graph shown in 
Figure 3,9a* Suppose a reader is required to make a comparison of the lengths 
of lines representing the A-C Link £#5) and the G-H Link £#6)* Perceptually, 
the lines appear equal in length* Wow let us orient the a-C link in the hori- 
zontal position, as shown in Figure 3*9b* We can now clearly see that the G-H 
link is greater in length than the A-c link* 'Ihis example demonstrates that 
differential sensitivity is better for horizontal lines than oblique lines* 
The same effect is also true for lines oriented vertically* The source of this 
effect is not optical (Mitchell, Freeman, £ Westheimer, 1967), but appears to 
be sonew^re within the neural mechanism involved in spatial resolution (Maffei 
& Canpb^ll, 1970)* We know of no generally applicable quantitative standards 
concerning the raco of change of differential sensitivity as a function of 
orientation* but Fiqure 3*9 doe*; suggest, qualitatively, the direction of this 
ctvm 

INSEkT FIGURE 3*9 IISkE 

Ar^ j termination 

Area ii ^crimination, oft^n require tor prows -sing inforrution found in 
" :r ot" nij^i- Thi ; tyj^ of Graphic display is oftmi us^-i to represent th+i nuu- 
\^r t f r**'j'i»*nci"^, density and tho iik^ of vari\t>U>£ varying q<*o^jrap!u^aily* Ar, 
iu <-,<dfftj)i^ A ,ir*vj discrimination, ^onMder tho. "spot" nap in Ki^um 3 * 1 f >, i 
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showing "technology manpower 1 ' for different regions in the United States- 
Manpower ls coded in terms of areas of circles: the larger the area, the 
greater the manpower. Thus, to process the information, we must be able to 
discriminate between areas. Suppose we wish to compare manpower between the 
northeast and Far viest regions. If one were actually to me* , .re the diameters 
of these two circles, one would find that the circle representing manpower in 
the Far West has a greater area than the circle representing manpower in the 
Hortheast. However, the areas do not differ by a JND 5 , making a visual compar- 
ison very difficult, if not impossible. The same graph is redrawn in Figure 
3.10b with the areas now differing by more than a JND 6 , and it is now possible 
to see the difference. 

Baird (1969) has reported a differential sensitivity for area of 6.0 per- 
cent. This value implies that for differences in area to be detectable, the 
areas must d" ffer by 6.0 percent or more. 

INSERT FIGURE 3.10 HERE 
Discrimination of Number " * 

"Nunerosity" refers to the subjective impression of the number of objects 
that a person can see in the visual field without counting the objects. Our 
ability to discriminate differences in the number of objects (e.g., dots) 



*The area of the circle representing manpower in the Far West is 0.785 in^, 
corresponding to a radius of 0.5". By Weber's Law using the value of differ- 
ential sensitivity of 0.06, discussed below, the area of the second ^i rcle 
should differ by: 

AS = 0.06 (0.785) 
- 0.047 in 2 

Thus, to be 3ust noticeably smaller, the area of: the second circle should bo 
0.733 in^ or ltfss/ corresponding to a radius of 0.484". The actual radius of 
this circle is 0.485", corresponding to an Area of 0.739 in^. 

'Th* radius of the circle representing manpower in tha par itest is still 0.5" 
(ar^a of 0.7&5 in^), but the radius of the second circle is now 0.479 in pro- 
<lucin<j an ir^i of 0.721 iir z , less than tho require! 0.730 in<\ 
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becomes important, for example, if one wishes to represent, say, ordinal infor- 
mation concerning population densities of various regions by different dot 
densities* Taves (1941) established a differential sensitivity index for dot 
nuoerosity of 0*204 und»r nonsimultaneous viewing conditions* This means that 
if population density of one region is represented by 10, ,cs, then the den- 
sity of another region, greater than the first, must be represented by 120 dots 
to be perceived as just greater* A third region relative to the second should 
contain 145 dots if it is to be perceived as just different* 
Discrimination of Color J 

Colors nay differ in their hue, brightness, and saturation* 

Hue 

"Hue" is the term referring to the dimension that separates red from 
green, and so on* Hue is a psychological property, existing in the eye of the 
beholder* Different hues are produced primarily by differences in the wave- 
length of light (measured in nanometers, or m\x) * Figure 3*11 shows the varia- 
tion in hue as a function of wavelength along the spectrum, from red through 
orange, yelLov, green, and blue to violet* 

INSERT FIGURE 3*11 HERE 

Our ability to detect differences in hue is not uniform for equal changes 
along the physical spectrum* Figure 3*12 shows mean JND's (AX) and standard 
deviations of hue as <* function of wavelength (A) from 410 m^ through 630 m^ 
(obtained from a set of experiments by Siegel and Dimmick, 1962, and Siegel, 
1964)* (p^rcall that the smaller the JMD, the greater the sensitivity*) The 
figure shows that peak sensitivity to hue difference is greatest in th« ranges 
of about 450 to 430 mu, corresponding to the yellow region* For this region, a 
change in spectral composition of less than 1*0 mu is needed to be perceived as 
"just riiff^ront"* Sensitivity to hue differences i^ weakent at the extremes of 
the spectrum (corresponding to tho violet and reJ regions) and also for the 
qr^en roqion at about 520 mji* 
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We can also see from the figure that the wavelength discrimination func- 
tion does not at all resemble the function described by Weber's Lav (which 
asserts the JtlO increases linearly with stimulus level)* One possible reason 
for this is that Ueber's law states that the amount of stimulus magnitude that 
must be added for a JND to be sensed must be proportional to the existing 
level* That is, discrimination satisfying Weber's Law are mediated by additive 
perceptual dimensions, such as loudness, whereas color is a substi tu te dimen- 
sion* In other words, increasing the wavelength of a patch of light does not 
lead to the perception of more of something, it leads to the percep .on of a 
different something* 7 And, not surprisingly, it has been found that displays 
that use a gradual shift from one color to another to represent a continuous 
variable are difficult to understand; we see such variation as a qualitative 
change rather than as a quantitative gradation (Wainer £ Francolini, 1980; 
Wainer, 1981 ) * 

INSERT FIGURE 3*12 HERE 
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These differences mirror differences in the neural substrate of sensation* 
For additive dimensions like lightness, the magnitude of the stimulus increas- 
es, the firing rates of neurons already responding to stimulaticn increase* 
it Bay well be that if this increased firing rate is sufficient, then it 
r.esults in a JTJD being experienced* If this phenomenon is not sufficient, 
then additional neurons are recruited and their added effects evenutally 
result in a JHD* 

Hue, on the other hand, is experienced as an attribute of quality in 
which discrimination is mediated by substitutive processes, that is, which 
n&urons are firing, not simply how many are firing or how frequently they re 
firing* There are four types of "spectrally opponent" cells responsible for 
color visio/i (See Devaloi^, 1975 for a good discussion*)* Briefly, these 
types are termed red-excitatory , green-inhibitory (+R-G), and yellow- 
e xcitatory , blue-JLnhibi tory (+Y-B) and the mirror image of these {+B- 
Yi) . Eacn of these types is spectrally turned to a particular range of wave- 
length, that is, for certain wavelengths, each responds in an excitatory man 
ner whilf* being inhibited for ither wavelengths* As one progresses across 
the sPectrm, there is no ade ,ional recruitment of neuro'ns, but instead, a 
substitution of excitation ol one cell for another* Therefore, the sensi- 
tivity <*i whatever cell is firing in response to the stimulus dictates how 
d ff'jr^ncos in wavelength are detected* It is interesting to note that the 
two minima of th** wavelength discrimination function occur in the same spec- 
tral regions as the "crosspomts" of .the two pairs of spectrally opponent 
^eils, tt>e fR-G and KJ-R cells having their crosi>point^ at approximately 590 
nf,<, and ttte + B-Y and *Y-B at approximately 500 mU* If discriminations are 
based on which cell i^ firing, then a double minimum' in the wa/el^ngth 
discrimination function is exactly what we would e>:pect* 
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When using color for coding nominal information in graphic displays* Table 
3*5 recoirjaends certain hues (coded in the Munsell classification) when fewer 
than nine colors are needed, which we recommend due to our limited memory capa- 
cities* The hue* in this table are maximally & l scriminable from one another* 

INSERT TABLE 3*5 "h.^<E 

Saturation 

If things have the same hue, it is still possible to detect a difference 
between them because of differences in saturation* Saturation can be thought 
of as the degree to which a color appears to be rich and pure, fr^e of white- 
ness, grayness, or blackness* For example, red differs from grayish red in 
saturation* If light consists entirely of a single wavelength, (say 53Gm*i, 
which corresponds to yellow) it is said to be completely homogeneous or mono- 
chromatic and has a "colorimetric purity" of 1* White light, on the other 
hand, is a mixture of all wavelengths, or "maximally heterogeneous" and has 
purity 0. Between these two extremes exis^ graduations in purity* if color- 
metric purity changes with the luminance h^ld constant the color seems to 
change principally in grayness* That is, as purity increases, grayness 
decreases* Colorrnetric purity/ then, is specified as the ratio of monochroma- 
tic light in a mixture of monochromatic and achromatic light* 

Studies investigating the maximum perceived saturation of various hues 
(Jones, and Lowry, 1926/ Priest and Brickweddle, 1938) have shown that satura- 
tion appears greatest at the extreme wavelengths and decreases to a minimum at 
about 570 mu* Thus, red and blue light will always appear more saturated than 
yellow litjht of the same colorimetric purity (i*e*, proportion of the light is 
composed of wavelengths of that hue*) 

Experiments by Paneck and Stevens (19oM and Inflow and Stevens (1906) havr? 
est'Juhed differential sensitivities for saturation of both primry and in- 
termediate hues* For saturation of red, a primary nu-;, Paneck and Steven* 
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found that a 2% change in purity is necessary for a just noticeable difference 
in saturation* Differential sensitivities tor changes for hues frcm 550 to 530 
mp (from a greenish yellow to yellow) and for hues from 630 to 583 mM (red^ to 
yellowish green) were investigated by Indow and Stevens (1966), who found Weber 
fractions in the range of 2%. Th. for example, if the purity of color at 
some dominant wavelength is, say, '80 percent, then purity at the same wave- 
length mU st be greater than 81*6 percent, if a noticeable difference in satur- 
ation is to be observed* 

Because saturation, unlike hue, is perceived as a continuously varying 
quantity, it is better to use variations of saturation (e.g., between white and 
richly colored, with pale as an intermediate) then hue in displays like maps 
where some variable must be plotted as a function of location (wainer S 
Francolini, 1980). 
Brightness 

Colors also differ in their brightness. Brightness discrimination invol- 
ves the ability to detect changes in luminance along the achromatic scale, 
black-to-gray-to-white. Lowry ( 1 931 ) has shown that for maximum discrimination 
to occur, the luminance of the field should be between 20 and 30 ml. Under 
this condition, the differential sensitivity is 1.4^. At lower luminance 
levels, discrimination decreases markedly. 
Shape Discrimination 

Discrimination of shape is a very complex phenomenon involving sensory, 
perceptual, and cognitive processes and interactions among these processes* At 
the higher levels of processing, shape discrimination comes under the headings 
of form perception or pattern recognition. We will consider here some investi- 
gations of our abilities to distinguish changes in relatively simple shapes as 
certain aspects of the shapes are varied. This may be important for the design 
of charts and graphs in which the shapes of a set of symbols vary continuously 
(s*<J*, from a horizontally-oriented ellipse, through a circle, to a vertically- 
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oriented ellipse) to signal values along some continuum* Similarly, there may 
be displays in which squares represent one entity, and rectangles represent 
another entity, so the two shapes must be discriminate if the display is to be 
unambiguous* In addition, dynamic inforrr* -ion su^h as, for example, an air- 
craft's glide angle is generally coded as ^ome s. ^ge (e*g*, a diamond shape) on 
a cathode ray tube* As the slope changes, the shape changes in its form some- 
what* Thus, to maintain a proper glide slope, the ability to recognize changes 
in shapes and discrimination is clearly important* 

Veniar (1948) examined subjects' ability to distinguish between a square 
and a rectangle oriented horizontally or vertically* She established a differ- 
ential sensitivity of 1*37 percent for shape distortion when eith^ the hori- 
zontal or vertical sides of the square were distorted* This value implies that 
if a 10 cm* X 10 cm.^quare is projected, a 10*14 cm* X 10 cm* rectangle will 
be perceived as just different* Veniar also investigated the effects of stimu- 
lus area and illumination or* discriminability and found no influence of these 
variables for the ranges considered* Note that this value of differential 
sensitivity is different from that found for "pure" length discrimination (See 
Table 3*5), suggesting that di:.^rent processes may be inv \ved in the two 
types of discrimination* In fact, in debriefings following the experiment, 
subjects reported that their judgments involved the shape as a whole* and not 
the individual line lengths* 

In another shape distortion experiment, Kelly and Bliss (1971) investi- 
gated sensitivity to distortions of diamond -shaped figures* Distortion was 
indicated in terms of diamonds appearing "taller" or "shorter 1 * than a standard 
defined as having a height/width ratio of 1*000* Consequently, diamonds 
appearing taller and had height/width ratios greater than 1*00 while thoso 
appearing smaller had ratios less than 1*00* Kelly and Bliss found a differen- 
tial sensitiv\ty of 4*8 percent, corresponding to height/width ratios of 
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1*048/1*000 and 0*952/1*000 for just taller and just shorter diamonds, respec- 
ti ve ly * 

2* The Principle of Perceptual Distortion 

Everyone kno'^s that things ar* not always as they appear* But most people 
seen to think this is largely due to the occasional optical sion. He _>r, 
in many cases there is no illusion but the perceptual system nevertheless is 
systematically distorting the relationship between the magnitude of the sensa- 
tion we feel and the value of the physical stimulus property which excited the 
sensation* 
Optical illusions 

Any introductory textbook on perception devotes considerable space to a 
discussion of illusions (e*^*, see Haber and Hershenson, 1982)* A number or 
illusions have been found to affect graph reading per se* For example, 
Cleveland (1982) found that color on a statistical map can cause an illusion: 
when colors were highly saturated, a red area was seen as larger than an equal- 
ized green area; when the colors were not highly saturated, however, no illu- 
sion occurred* Another illusion discovered by Cleveland, Diaconis, and McGill 
0 932J is directly relevant to one of the most common display types ; simple 
scatterplots in which points are plotted within a set of coordinates* 



Cleveland et* al* asked subjects to judge the degree of "linear association" 
between the two variables plotted* all subjects had some statistical training 
9 and understood the instructions. Judgements were made using a 100 point scal^, 

with 0 being equivalent to r=0 and 100 being equivalent to r»1* When the scale 
was reduced on the frame, so that the "point-cloud" was reduced in size, sub" 
) jects saw a higher degree association* This shoulc" be evident in Fiyure 3*12* 

INSERT FIGURE 3*12 HE1U' 
An assortment of other illusions may be relevant to special kinds of 
> graphic displays (see Wainer & Thissen, 1981)* For example, the "top hat illu- 

sion" results in our seeing vertical lines as longer than horizontal lines of 
the same length* in some exotic plots line-length and orientation can be used 
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to represent information so some lines may be vertical and others horizontal — 
in vhich case this illusion would be a source of mis-information* In gee i ral, 
however/ most optical illusions usually discussed in perception texts are not 
likely to affect charts and graphs* 
Systematic Distortions; The Po Law 

The relationship between the physical magnitude and the psychological 
magnitude can be expressed by the following formula/ due largely to the work of 
S*S* Stevens* 

¥ = k* b (4) 
In the equation above/ ¥ is the subjective magnitude of the sensation/ $ is the 
physical magnitude of the stimulus itself; and b is an exponent (to be deter- 
mined from empirical data) which characterizes a particular sensory modality (k 
is simply a constant which relates the units of sensation to those of the phys- 
ical stimulus property)* In other words/ for any perceptual continuum, the 
perceived magnitude of a stimulus is some power function of the stimulus *s 
physical magnitude, with the exact power in the function varying from continuum 
to continuum* Steven's law is often called the "Power Law" for this reason* 

Because the power or exponent in the power function (b) is not necessarily 
equal to 1*0, sensations often do not change in direct proportion to changes in 
the physical stimulus* This has some important implications for reading charts 
and graphs* In the remainder of this section we will discuss the various con-** 
sequences of the power law for chart and graph comprehension* We shall provide 
estimates of exponent values for the visual continua commonly employed in pro- 
ducing charts and graphs* Bas^d on these estimates and other research results, 
we shali make recommendations on how best to use thsse continua in visual dis- 
plays* In many instances, there is considerable variation in estimates 
exponents, due to differences in research methodology, in which cases we pro- 
vide tho range of values* 



97 



ERLC 



General Consequence of the Power Law 

The form of equation 4 has several mathematical properties reflecting 
properties of perceptual systems that may have enhanced the species's chances 
for survival in a natural environment* First, the power law provides for ~atio 
invariance* That is, equal stimulus ratios induce equal sensation ratios As 
a consequence, an object in the environment appears to retain a constant size 
and shape in relation to background objects as its position changes relative to 
the observer* Second, for some sense modalities, such as visual brightness, 
the natural environment may present a broad range of values (up to 10 orders of 
magnitude)* jf the visual system wer^ to transduce and process brightness in- 
formation linearly (i*e*, the exponent were t), the system would have to be 
much larger to register the entire continuum and probably would have to possess 
a greater neuronal mass* In fact, however, the psychophysical exponent for 
brightness is less than one* This enables the same range of physical bright- 
ness to be registered within a smaller sensory system* The information that is 
lost by virtue of the nonlinear sensory transformation has little importance 
for survival* 

These two advantages provide an explanation of why the evolutionary pro- 
cess has favored a power function for sensory encoding* in terms of graphical 
applications, however, the consequences may not be so happy; the power law can 
distort the presentation of ft information when the continuum that is being used 
ha* an exponent greater than or less than 1* 

The perceptual distortion that occurs as a result of the psychophysical 
power law dan best be explained using a graph* Figure 3* 13a shows a power 
function with an exponent of *7 (i*e*, the physical magnitude is raised to that 
power to predict the corresponding subjective magnitude)* As a consequence of 
the exponent being less than one the value of the sensation, f, increases less 
rapidly than that of the stimulus, ^* Figure 3* 13b shows a graph of a power 
function where the exponent, b, is 1*2* In this case, the value of the sensa- 
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tiori/ ty, increases more rapidly than that of the stimulus property. Finally/ 
Figure 3.13c shows a plot of the power function whose exponent is 1 .0. This 
function is represented as a straight line. In this and only this case no 
perceptual distortion occurs/ with sensation increasing at the sane rate as the 
stimulus magnitude. Th , ?tf en the exponent is equal to 1.0 do things 
differ in the way they appear to differ. 



The foregoing characteristics of the sensory power law have important 
implications for graph construction and comprehension. First/ equal intervals 
on a stimulus continuum do not/ in general/ correspond to equal intervals on 
the subjective continuum. This is made clear in Figures 3.13a and 3.13b where 
w A and are equal intervals on the physical continuum The corresponding 
subjective intervals Wj and are obviously not equal (W^W^). Equal physical 
intervals correspond to equal subjective intervals only for, sensory dimensions 
whose power functions have an exponent of 1.0. In the construction of graphs/ 
one is often concerned with conveying a relation between two quantities by 
using a corresponding visual relation between graphic symbols representing 
those qualities, when the information to be conveyed is interval scaled, the 
graph maker should be aware that equal intervals on the scale of interest may 
not portray equal subjective intervals on the particular visual continuum. 

For example, suppose we wish to construct a chart that provides informa- 
tion on various occupations. Each occupation xs to be represented by a circu- 
lar area of uniform size and the mean intelligence quotient (I.Q. ) for people 
in each occupation is to be indicated by the apparent lightness of a particular 
red hue used to coio** each circle. Consider three occupations/ A, B, and C, 
with mean I.Q.'s of 130/ tiO/ and 90, respectively, which are represented with 
a (S22.nm red color with a coiorimetric purity of .51. Table 3.6 provides a 
list of the exponents for perceived lightness as a function of the 
corresponding physical stimulus property/ reflectance/ considered separately 
ror various 



INSERT FIGURE 3.13 HERE 




colorimetric purities for different hues. Assuming that the red graph is to be 
read under artificial light, an exponent of .62 is appropriate. Using an 
arbitrary constant (k=i.O) for the formula (in equation 4), lightness is 
plotted against reflectance in Figure 3.14. 

INSERT TABLE 3. 6 P :GURE 3.14 HERE 
Because the three occupations are spaced at equal (20 point) intervals we 
must choose our reflectance such that the intervals in subjective lightness are + 
also equal. The symbols on the chart can be made .most discriminable if the 
full range of available reflectances *re used. Similarly, the meaning of the 
chart is made most transparent if lighter' 3 hades of red are assigned to occupa- 
tion groups with the brighter people (for reasons to be discussed later). 
Thus, we select a value of .3 on the subjective lightness scale for occupation 
A, a value of .2 for occupation B, and .1 for occupation c. The positions 
marked 1^, Ig, and 1^ on Figure 3.14 indicate the corresponding points on tne 
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psychophysical function. From this it can be seen that reflectance values of 
.170, .094, and .034 must be employed for occupations A, B, and C, respec- 
tively. 

Although this set of ^values satisfies the requirement that equal intervals 
in the referent scale tl.Q.) are represented as equal intervals on the subjec- 
tive lightness scale, it is certainly not unique in this respect. We might 
have selected a subjective lightness value g£ »29 for occupation A and used .9 
unit decrements on the lightness scale for the other occupations. The graph 
maker has considerable latitude in following this procedure to remove distor- 
tion in interval scaled graphs. In addition, the level of precision discussed 
here may exceed that needed for most uses of most charts and graphs. But in 
all cases, the graph maker should be aware that equal differences in physical 
units may not be seen as equal and hence may not function to communicate effec- 
tively . 
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Of course# in some cases, adjusting the elements of the graph to compen- 
sate for the distortions of the human perceptual system may be the wrong thing 
to do* For example, ^or elements ?uch as squares with different areas, the 
reader may want actually to measure the elements to obtain absolute value, to 
interpolate, or to verify that the graph really suppc*" the claims made in the 
accompanying text* In such a case, altering the areas so that they differ from 
the exact quantitative values dictated by the information being communicated 
would have disastrous results* Note that simply by choosing a continuum with 
an exponent of 1*0 to begin with to represent the quantities, the graph maker 
thereby avoids both perceptual and physical distortions* 

The second majcr'implication of the power law for chart and graph con- 
struction concerns the property of ratio invariance, which was mentioned earl** 
ier* Recall that "ratio Invariance" means that if the ratio between the values 
of two stimuli is equal to the ratio between the values of two other stimuli, 
then the ratios of the corresponding pairs of sensations are also equal*** A 



9 This can be demonstrated quite succinctly by the following equations . 

Let $2* ^3 an< ^ \ represent values on a physical continuum for four sti- 
muli, and let {>j, ^ and ^ represent the sensations which result from 
these stimulus values. Then ifj 

_ = _ (a) 

$2 

— = „ (b) 

This is most; easily seen by employing the power law, ^ = k^ to rewrite / 
equation (b) in terms of equation stimulus properties. 



i t follows that* 



b b 
k<^ k^ 

Raising boths sides of equation (c) to the (l/b)th power yields equation (a), 
as promised, 

4>l $3 



(d) 
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particular ratio of stimulus properties will not/ in general/ yield the same 
ratio of sensation magnitudes* Given a stimulus ratio, resulting 
sensation ratio, i^i/^' *- s Thus, ratios of stimulus properties 

are also transformed by the power law* A given stimulus ratio can produce the 
same sensation ratio only when the sensory system invo^ ■ is characterized 
an exponent of 1*0* 

Let us suppose, vhen, that we wish to convey the idea that country A has jc 
times the population of country B by sketching outline maps of the two coun- 
tries, where the apparent sizes of the drawings reflect the corresponding popu- 

L 

lation sizes* In order to accomplish this graphically/ we would employ the 
exponent b in the psychophysical relationship between the physical property/ 
area, and its subjective correlate/ apparent size, as follows: 

x = V or x 1/b =*A (5) 

B r B 

If we select an appropriate area, for country A, then equation (5) can 

be used to determine $ B such that the ratio of apparent sizes is x* 

*B = V U)1/b {6) 
The non-linearity of the psychophysical power law, therefore/ has implications 
on the use of graphic symbols to portray relationships between symbol refer- 
rents on intervals and ratio scales* Knowledge of the values of the power law 
exponents for the various visual continua/ together with an understanding of 
their role in perception, is important for accurate graphic communication* 

The power law does not only bring bad news to the graph maker/ however* 
Simple (i*e*, nonpolynomial) power functions are mono toni c: they either in- 
crease or decrease along their, domains/ but never both* That means that ordi- 

relationships (e*g*, A is larger than B, but not by any particular amount 
or ratio) will virtually always be perceived veridically if conveyed by some 
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physical continuum* If all the graph maker wishes to convey is the ordinal 
relationships among a set ol entities, and not their exact differences or 
ratios,, the power^ law does not entail any problems* 

A few words of caution are in order before proceeding with a discussion of 
the systematic distortions to be found in the visual continua that are impor- 
tant to chart and graph construction* In many cases there is considerable 
variation in the exponents that were determined empirically for a given contin- 
uum by different investigators* At first glance this variation would appear to 
be random* However, the results obtained from a given type of measurement 
procedure are sometimes known to be consistently higher or lower than those 
from another procedure* Because much of the variation iu the tabulated expo- 
nents is, in this sense, systematic, the reader should view the results to 
follow with reduced skepticism* 

Most of the estimates we provide were obtained either with an "estimation 
procedure" or with a "production procedure"* In the estimation procedures, a 
subject is first shown a standard stimulus to which the experimenter assigns a 
number, and the subject is then asked to respond to each of a series of experi- 
mental stimuli by saying a number which reflects the subjective or perceived 
relationship of the experimental stimulus to the standard stimulus* In the 
production procedures, the experimenter presents the standard, and gives the 
subject the means to vary stimuli (e*g*, by turning a knob controlling the 
brightness of a light) to yield a given relation to the standard (e*g*, five 
times as bright)* In general, estimation procedures yield lower exponents than 
production procedures* The availability of the standard stimulus and its mag- 
nitude in relation to th** experimental stimuli also have systematic effects on 
the exponenc later computed* 

With this in mind, let us now specifically examine the visual continua 
which are most often used in graphic representation* 
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Line Length and Inclination 

The relationship between physical line length and its subjective corre- 
late/ apparent line length, has been studied thoroughly by a number of invest! 
gators* Table 3*7 presents the results of some of these studies, as well as 
some of experimental conditions and methods employed, 1 all cases the 
exponent was found to be close to_1*G, thus indicating a linear relationship 
between the physical and subjective continua* Although the veridicality of 
length perception is not surprising, it is, nonetheless, a fortuitous result 
pecause it obviates the difficulties of employing nonlinear transformations in 
order to encode information so that it is'perceived accurately* 

INSERT TABLE 3*7 HERE 

In light of this finding it is easy to understand why bar graphs are so 
pervasive as a graphic format* Information in a bar graph is encoded directly 
by the lengths of a set of discrete lines or bars* Because perception of 
ratios is veridical in this case, the information can be assimilated without 
resort to any mental or graphic gymnastics* 

Stevens and Galanter (1957) found that the" relation between angular orien 
tation of a line and subjective inclination is also linear* And as before, * 
Miller and Sheldon (1969) obtained a linear relation between the average in- 
clination of a group of six lines of varying orientations and the subjective 
average as perceived by their subjects* 

Thus, we may conclude that straight lines are well behaved in a psycholog- 
ical sense* The relation between physical attributes, such as length and ori- 
entation, and the subjective correlates of these attributes is linear, even 
when average quantities describing groups of lines are at issue* 
Area 

Unfortunately, the simple relation that obtains for the actual and per- 
ceived length of lines is lost when lines enclose areas* The, relationship 
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between physical area and apparent size of various two dimensional figures has 
been studied extensively by numerous investigators* The results of some of 
these studies* organized by the shape of the figure, are presented in Table 3,8* 

INSERT TABLE 3.8 HERE 

Initial petusal c this table leaves one befuddled by the wide range of 
exponent estimates (from *55 to 1*20)* However, much of this variation can be 
attributed to identifiable sources, many of which have implications for graph 
construction* One important influence on the exponent value is the instruc- 
tions given to subjects prior to the experimental task* Teghtsoonian (19^5) 
asked half of her subjects to "estimate the apparent sizes" of a set of cir- 
cles* The othei: half were asked to "base their judgements on the actual physi- 
cal areas"* The exponent resulting from apparent size instructions was *76, 
whereas that resulting from physical size instruction* was 1*03 (see also 
Macmillan et* al*,°'(1974)f Teghtsooni^n, 1965)* 

These results suggest some guidelines for the graph maker* Under certain 
circumstances a person's perception of -the ratio of two physical areas is 
nearly linearly related to the actual ratio of physical areas* These circum- 
stances are* 1) the person is specifically asked to attend to physical area; 
2) thti person understands the concept of area and how it is calculated; 3) the 
stimuli are similar in shape and the shapes possess enough linear cues to 
enable an accurate area estimate* Under these conditions/ the exponent ranges 
from * /5 tc 1*0* In contrast, if a person is instructed to (or will spontane- 
cusly) attend to apparent size, or the shapes are nonsimilar and irregular, or 
the subject is mathematically naive, exponents may be substantially lower* A 
person who' interprets apparent size to be indicated by a prominent linear 
dimension will operate with an exponent near *5; however, most people will 
operate with exponents between .7 cad *8* — resulting in reader's systematic- 
ally under-estimating differences among increasing areas* 
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Thus, given the incompleteness of present theories of how we compare the 
areas of differently-shaped figures* the graph maker is advised to avoid using 
different shaped figures when precise interval area relationships are to be 
communicated* Sadly, this could apply to the currently popular maps in which 
the magnitude of some attribute of a ^uitry (e*g*, oil reserv is conveyed 
by the si2e of the country on the "map* 

Volume as Implied by Perspective Drawings and Volume in Real Space 

Perspective drawings of solid objects are frequently used as symbols in 
charts and graphs* The use of these drawings has been of particular interest 
to cartographers who use symbols to convey simultaneously information about a 
location and some other attribute/ such as population, of cities on maps* For 
them, the use of simple area symbols for cities becomes unwieldy because heavi- 
ly populated cities require inordinately large areas which would imprecisely 
mark the location of the city and possibly obscure smaller cities in the vicin- 
ity* Implied volume is one way of overcoming the problem* For example, per- 
spective drawings of two cubes, one with 1 mm edges and one with 10 mm edges, 
imply a volume ratio of 1000:1* On the other hand, simple squares with 1 mm 
and 10 mm sides imply an area ratio of only 100:1* The availability of this 
theoretical advantage is of course contingent on readers having the ability to 
estimate ratios of volumes in real space accurately/ and on their perceiving A 
perspective drawing of a solid as they would an actual solid* Therefore, the 
exponent for volume in real space should be close to 1 (at least when subjects 
are asked to attend to actual physical volume rather than apparent size) and 
the exponents for actual solids and perspective drawings of solids should be 
nearly equal* 
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Exponents for the psychophysical relation be*.^een physical and subjective 
volume for real solids have been estimated by Ekman and Junge (1961) and by 
Teghtsoonian (1965)* These are presented in Table 3*9* Exponents for the 
relation between physical and subjective volume as 5 mplied by perspective draw- 
ings have been estimated by EJcman and Junge (I960), H961) and by Ekman, 
Landman, and William-Olson (1961)* These are presented In Table 3*10* 

INSERT TABLES 3*9 AND 3*10 HERE 
Ifte conclusion to be drawn from experiments on drawings of cubes 
(obviously of more interest to the graph maker than real cubes), then, is that 
most people compare small perspective drawings of three dimensional objects on 
the basis of the area enclosed by the drawing and not by the actual volume 
implied* The graph maker, therefore, should not attempt to employ perspective 
drawings with tha expectation that readers will perceive differences in volume 
veridically* 

Proportion and Mumerousity 

The concept of proportion is of ten 'conveyed graphically by a pie chart* 
Radii at various inclinations divide a circle into segments, and proportion 
information is encoded primarily by the relative areas of the segments* 
Because the perceived inclination of a line is linear in relation to the actual 
inclination, as previously discussed, the pie chart should be effective in 
conveying proportion information* The display format of the pie chart is very 
rigidly structured, however, and not conducive to conveying information in 
addition to proportions* If, for instance/ the chart is to be used to show the 
proportions of different ethnic groups in the U*S*, all members of each group 
must be gathered up, in a sense, and put in the appropriate segment* Informa- 
tion about the proportion and the geographic distribution of group members 
cannot be conveyed simultaneously* 

Studies have been reported by Stevens and Galanter (1957) and by Rule 
(1963) on a subject's ability to estimate proportions in a less structured 
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format* Although these investigators were not specifically concerned with 
graph and chart comprehension, their results can certainly be applied to this 
issue* In the Stevens and Galanter study the stimuli were blue and green dots 
placed randomly in an 8 cm square* The total number of dots was 36 in all 
cases but the proportion of blue to green was altered for the var,^ . .imuli* 
When subjects were asked to estimate the percentage of blue and green dots, 
estimates were most accurate at the two ends and at the center of the stimulus 
range* However, the subjects' percentage estimates increased linearly with 
increases in the actual proportions* In a similar experiment, Rule asked sub- 
jects for magnitude estimates of the proportion of dots and lines occupying the 
positions of an eight by ten rectangular array and obtained an exponent of 
*97* 

Numerousity, as noted earlier, refers to one's subjective impression of the 
number of elements in some collection gauged without counting these elements 
one-by-one* This continuum differs from that of proportion in that numerous- 
ity is concerned with elements of one type, the number of which is neither 
confine to a specific range nor considered in relation to the number of some 
other type of elements* Estimates of the exponent for the perception of numer- 
ousity range from *65 (Taves, 1941) to 1*34 (Stevens, 1957), reflecting differ- 
ences in methodologies* Krueger (1972), examining the perception of numerous- 
ity and how it is affected by display si2e, offered an exponent of *85 as his 
best estimate for a true exponent for numerousity, averaging across estimation 
(*72-*78) and production (*93) methods* The exponent of *85 for numerosity 
indicates that subjects typically under-es timate the number of items preset* 
Items displayed in a compact area tend to be underestimated more than those 
shown in a large format (owing mainly to a difference in the proportionality 
constant, k (e*g*, 4), not the exponent), but the effect of the size of the 
format seems co diminish (or saturate) at some point* The exponents for pro- 
portion and numerousity are presented in Table 3*11. 
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INSERT TABLE 3*11 HERE 

Lightness and Saturation 

In many instances, differences in color attributes or lightness of gray 
tones serve as a basis for differentia* 'ng chart and graph symbols* we# there- 
fore, include a brief discussion of the power law exponents of various chroma- 
tic and achromatic attributes of visual stimuli* 

In order to minimize ambiguity in this presentation* recall our earlier 
use of sorae relevant tevms* Hue is the attribute of a color "perception denoted 
by the names blue* green, yellow* red, purple, etc* An achromia tic color per- 
ception is one which possesses no hue (e*g*# white* gray* and black)* Satura - 
tion is the attribute of color perception determining the degree of difference 
from the achromatic color most resembling it* Brightness (of an area perceived 
as self luminous, such as a computer video display) is the perceptual dimension 
ranging from very dim to very bright or dazzling* Lightness (of an object 
perceived as non-self luminous, such as a price of paper) is the perceptual 
dimension ranging from dark (black, for achromatic stimuli)/ to light (white, 
for achromatic stimuli)* Recall that each of these subjective continua is 
associated primarily with a physical continuum* Hue is chiefly associated with 
wavelength, saturation with colorimetric purity, brightness with luminance/ and 
lightness with the luminance factor (percent of incident light of what the 
surface reflects back)* More conplex relations are also operative in color 
vision; for instance, hue is affected somewhat by' purity and luminance/ and 
brightness is affected somewhat by wavelength arid purity* 

Guirao and de Mattiello (T974), using non-self luminous surfaces/ obtained 
exponents reproduced in Table 3*12* Note that the exponents for small-sized 
fields are greater than those for the large fields/ regardless of the type of 
illumination, for all hues except yellow* Also/ blue, green and red have lower 
exponents vhen viewed under daylight conditions than under artificial light* 
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Yellow and orange appear to be unaffected by the type of illumination* Self 
luminous colors, such as those which appear on a computer graphics screen, are 
characterized by lower exponents than the surface colors (see Indow & Stevens, 
1966), although these exponents generally are over 1*0* 

INSERT TABLE 3*12 HERE 
Xn another scudy, oe Mattiello and Guirao (1974) examined the relation 
between lightness, luminance factor (% reflectance), and colorimetric purity* 
The exponents they obtained for lightness as a power function of percent 
reflectance at a given colorimetric purity are presented in Table 3*13* 

Although no studies have been performed to determine whether saturation 
ex/onents change continuously with the size of the color patch, the graph maker 
should be aware that saturation may be affected by the size of a colored 
figure, with greater exponents for smaller areas* It is almost as if the same 
color placed in a smaller area appears "denser" and hence, r ->re saturated* 
Thus, slight differences in colorimetric purity may be required to make two 
figures of the same hue but different sizes appear equal in saturation* 

INSERT TABLE 3*13 HERE 

Conclusion 

In closing this section, it is worth noting that the foregoing principles 
allow us to explain some data collected on graph reading per se* Croxton and 
Stein (1932) examined the ability of bar graphs to convey the relative mag- 
nitude of two quantities* They compared the accuracy of subject's estimates of 
the ratio of: (1) one bar length to another, (2) one square area to another, 
(3) one circle area to another, and (4) one cube volume (as depicted in a line 
drawing) to another* They found that accuracy of the estimates decreased with 
increasing number of dimensions; bars were more accurately compared than 
squares, circles or cubes; squares and circles were more accurately compared 
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than cubes; and squares and circles were equally well compared* These results 
are not surprising* Given the fact line Length is known to be a linear fun^ 
ction of physical Line length but perceived area and perceived volume are non- 
linear power functions of physical area and volume respectively, with the 
exponent of ^ . voLume power function deviating further from 1*C tnan that of: 
the area power function* Croxton and Stein also compared the accuracy of esti- 
mates of the relative arcs of pairs of circles and squares when the centers of 
the paired figures were horizontally aligned versus when the bases of the fig- 
ures were horizontally aligned* Wo difference was found, as we would expect 
from the foregoing discussion (see MacDonald-Ross, 1977, for an extensive 
review of this literature)* 

Thus, the material reviewed in the preceding sections chapter can serve as 
a substitute for a vast number of potential experiments on the accuracy of 
reading graphs *rith different sorts of physical marks* Where the Weber frac- 
tion for a sensory continuum is small and the exponent is close to 1*0, ^e can 
expect more accurate interpretation than we vould for other continua* Even 
better, we can predict the types of errors that will be made, the sorts of 
adjustments that can eliminate the errors and how the type and extent of dis- 
tortion varies with extraneous factors such as size and illumination* 
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II* Natural Units 

1* Gestalt Principles 

Xf a display is to be read accurately, the Marks must be read and organ- 
ized correct^ so far we nave b*en concerned with factors that must be con" 
sidered if the marks themselves . e to be read correctly, and now we turn to 
factors that underlie how we organize marks into psychological units* As we 
shall see when we turn to the graph comprehension model outlined in Chapter 6, 
how the visual system parses the visual input into units and links these units 
together has important consequences for how easily the various parts of the 
graph are recognized and how easily the appropriate quantitative and conceptual- 
information will be extracted from it* This section will review briefly the 
progress made in the study of perceptual organization and consider the implica- 
tions for graph construction* 

The Gestalt psychologists began work in the 1930's that has led to some 
genuine understanding of how visual stimuli are organized perceptually* The 
Gestalt psychologists believed that visual stimulation initiates the action of 
organizing electrical forces in the nervous system, which separate figures from 
their backgrounds, establish distinct groups of objects, and define structure 
in the visual scene* Although many of the physiological models postuiated by 
the Gestalt psychologists have since been proved incorrect and many of their 
explanations of organizational processes have been found inadequate, some key 
features of their approach to visual organization continue to be of value in 
the study of perception* Many researchers now believe that the Gestalt Princi- 
ples reflect the operation of mechanisms that seek to carve the continuous 
cotic array into distinct portions, each of which corresponds to a physical 
object in the world* Although charts and graphs are not typical visual stimuli 
found in the world, we can expect that these same mechanisms will go to work on 
graphic stimuli and attempt to discern "objects 11 and their interrelationships 
in thsm* 
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Max wertheimer (1938) formulated a set of "laws" (or principles) of 
organization of visual forms involving the following factors: proximity, simi- 
larity, continuity, closedness, and symmetry* Several of these factors may be 
operative in the same scene* In some cases, rll factors may be cooperating to 
emphasize a common structure* in other case:* factors may be set in opposition 
bo each other, with each promoting a different structure* On such occasions, 
one of the alternate structures usually emerges as dominant, although weakened 
by the conflict* The major shortcoping of wertheimer's principles for our 
purposes becomes apparent here* That is, when two factors are in conflict, 
Gestalt Theory cannot predict which will emerge victorious* In the ensuing 
discussions of each organizational factor, we shall provide examples of cooper- 
ation and conflict to illustrate the operation of these principles in charts 
and graphs* 
Proximity 

Figures that are situated near each other tend to be associated with each 
other Lu foLV. z common structure* This is demonstrated quite clearly in the 
examples shown in Figures 3*14a and b* The spacing between the circles of 
Figure 3*14a ii .uces the observer to group pairs of circles to9ether in the 
pattern 12/34/56/etc* In Figure 3*14b the observer is lead to group the 
squares into triplets; 1 23/456/789/etc* in these simple examples the influ- 
ence of proximity is so strong as to preclude alternative groupings such as 
1/23/45/67 in Figure 3*14a* 

INSERT FIGURES 3*14 AMD 3*15 HERE 

Grouping by proximity is easily studied because it is one of the few 
Gestalt principles where it is possible to obtain quantitative measures of the 
stimulus properties* For example, Kohler and Adams (1961) used an array simi- 
lar to that shown in Figure 3*15, but varied the ratio of spacing between rows 
and columns (from 1*0, equal distances, to *25, where inter-row space is 4 
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times inter-column space)* if subjects were not directly attending to the 
display/ the stimulus ratio necessary to induce thera to report row or column 
organisation was about .38* If subjects were directly attending to the dis- 
play, however, and they were looking for row or column organization, a ratio of 
about .62 was required for articulation* In other wo*, =>, whet. . person active- 
ly looks for a particular sort of organization, the stimuli need not be physic- 
ally separated to as great an extent as when a person has no prior organization 
in mind when first seeing a display* These figures then, give some rough esti- 
mates of how to space a field of patterns to use proximity to engender an 
organization into rows and columns/ depending on whether the reader is expected 
to anticipate, a given organization in a graph or not* 

Proximity is one of the factors most commonly used to organize a chart or 
graph* For instance, except when a remote key or legend is employed, proximity 
is the usual means by which labels are associated with their referents* Figure 
3*16 shows an example of a graph extracted from one of the national news maga- 
zines in which the labels for the vertical axes are located at some distance 
from the axes to which they correspond* In this case/ confusion results/ al- 
though it is not. severe and can be resolved in a short time* This failure to 
use proximity to express association/ however innocent/ exacts a cost from the 
reader in his attempt to understand the graph* 

INSERT FIGURES 3*16 AND 3*17 HERE 

A more severe problem can ensue when proximity is misused so that an un- 
intended structure emerges* Consider, for example, Figure 3*17a* In this case 
a 5 by 5 array of circles is employed to show the fraction (column labels) of 
the average daily requirement of various nutrients (row labels) supplied by a 
serving of a given food substance* Because the distinct entities to be scaled 
are the various nutrients (one nutrient per row), and not the various propor- 
tions in their own right (which have no intrinsic interest)/ the semantics of 
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the chart requires an organization into rows* Ifte relative spacing between 
circles in the vertical and horizontal directions, however, clearly induces a 
perceptual organization in^o columns* Iftis renders the information in the 
graph obscure until the conflict between the semantics and syntax is resclved* 
Figure 3* 17b shows this chart redrawn with the proximity of circles favoring 
articulation by rows* Note that the meaning is much more evident when proxim- 
ity is properly employed* 



Similarity 

pfgures 'which resemble each other tend to be seen as grouped together* 
Figure 3*18a and b show an example of similarity acting as a grouping factor* 
Ihe 12/34/56/78 pattern emerges clearly in 3*18a as does the 123/456/789/10 
pattern in 3*t8b* Note that in 3*18b this structure emerges in spite of a 
stimulus spacing which favors a 12/34/56 structure* For estimates of the 
strength of grouping by various sorts of similarity (brightness, shape, hue, 
etc*) relative to proximity grouping, see Hochberg and Silverstein (1956) and 
Hochberg and Hardy (1960), some-of whose results are summarized in Figure 3*19* 

Similarity can be quite useful in expressing a relationship between graph- 
ic elements which, because of the format of the graph, must be situated at some 
distance from each other* For instance, Figure 3*20 shows a series of three 
graphs, each of which contains three specifier elements* In this case, al- 
though the 3 graphs discuss different topics (as indicated by the^r titles), 
each topic is discussed in common terms: a normal range, an actual level, and 
a minimum acceptable level* Although the specifier elements in each case are 
appropriately and adequately labeled, the similarity in form and texture of the 
elements serving the same function in each graph visually emphasizes the common 
semantic interpretation* in fact, once the specifier elements of one graph 
have been identified and interpreted, the meaning of the elements in the two 
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remaining graphs becomes immediately obvious* The alphanumeric labels of the 
specifier elements in these remaining graphs assume only secondary importance 
in the presence of the similarity grouping factor* 

As with proximity* similarity can be misapplied to su^ est an unintended 
structure* Consider the modification of the previous serit of graphs shown in 
Figure 3*21* In this series the alphanumeric label "ACTUAL LEVEL", is printed 
in large bold face type supposedly to emphasize the importance of this element 
relative to the other two* At first glance, however, this type face closely 
resembles that of the graph titles, "CRUDE-OIL STOCKS", etc* Only upon close 
scrutiny can the reader discern the slight differences in aspect and slant* 
The net result of this unintended similarity is that the reader may at first be 
led to believe that this label is part of the graph title and the reader pays a 
price, albeit small, in time and effort to correct this false implication* 
INSERT FIGURES 3*21, 3*22, AND 3*23 HERE 

Although Figure 3,19a shows similarity and proximity competing to promote 
different structures, these factors can be made to cooperate in emphasizing a 
single structure* Figure 3*22 shows such a situation* 

In general, it has been found in studies of tabular and textual materials 
that the use of similarity and proximity in providing redundant information 
about hierarchical organization benefits readability* For example, labels 
indicating subdivisions at the same hierarchical level should be similar in 
their left-to-right placement on the page, size of type, boldness of type, and 
case (Wright, in press)* The subdivisions themselves in a list are best when 
set off so that their left margins are aligned, so that the pieces in a sub*- 
diyision cohere because of proximity, common fate, and good Zorm (see belovr) 
(Hartley, 1978; Stewart, 1976; waller, 1977)* For a graph maker similar pat- 
terns of readability can be expected: subgraphs or groups of lines or bars 
that are related at a particular level in a conceptual hierarchy (e*g*, differ 
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ent year^/ different seasons within a year/ different months within a season) 
should b<* near each other, similar to each other, aligned with each other, and 
should bear alphanumeric labels with similar visual characteristics* Con- 
versely parts of graphs that belong to different groupings should differ along 
thw ,ime s* ^ 

Althou h the redundant application of grouping factors can be a powerful 
tool in graphic representation/ its misapplication can severely obscure the 
meaning of a graph or chart* Figure ^23 shows a'-chart similar to that of 
Figure 3*17 but with a particularly unfortunate coincidental distribution of 
nutrients* In this case the proximity and similarity factors cooperate. to 
render any but columnwise organization inaccessible* 

INSERT FIGURE 3*24 HERE 

Symme try 

A symmetric arrangement of marks is more likely to be interpreted as a 
figure than the sane marks in the absence of the symmetric relationship* Ihe 
operation of symmetry as an organizing force is demonstrated in Figure 3*24* 
The parts labelled a, b, and c in the left half of the illustration stand out 
as white figure on a dark background whereas the corresponding forms/ d, e, and 
f in the right half of the illustration appear as dark figures on a light back- 
ground* 

As for possible applications of symmetry grouping to charts and graphs, it 
is noteworthy that a symmetric arrangement is the conventional format for pre- 
senting a key, legend or table* The overall symmetry of these items first 
identifies then as simple structures, and secondly establishes the desired 
correspondence between opposing elements* Compare the keys on both panels of 
Figure 3*25/ the left is clearly superior* 

Figures 3* 26a and b demonstrate another use of symmetry? that of grouping 
together different graph elements to emphasize the convergence or divergence of 
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the pair/ rather than the individual trends of each element. In these figures 
the vertical arrangement of the two subgraphs in combination with the unconven- 
tional location of the horizontal axis in the upper subgraphs creates a symme- 
try which draws attention to the higher-order relationship betweer the speci- 
fier elements. Figures 3.27a *rtd b show alternative represantati of the 
information contained in the subgraphs of 3.26b. Note that the effect of thes.e 
presentations is much less striking. 

INSERT FIGURES 3.25, 3.26/ AND 3. 27 HERE 

Good Continuation 

When presented with a configuration of discrete marks or a set of curved 
lines that cross each other or double back on themselves, an observer will 
perceive the organization in which the elements are as smooth and continuous as 
possible. Figure 3.23 a, b, c, d, € and f show examples of configurations in 
which this phenomenon operates. Note/ for example that the discrete points in 
3.28a appear to be structured as distinct straight or smoothly curved continu- 
ous line segments. Perhaps the most likely organization to be perceived in 
3.23a is that shown by solid lines in 3.23b. Again in 3.23c one is most likely 
to perceive a 12/3 structure rather than the possible alternatives (13/2 or 
23/1) because this dominant structure avoids sharp changes in line direction. 
Similarly/ the organization 13/24 emerges in 3.28d for the same reason. Fig- 
ures 3.23e and f show cases in which the factor of good continuation is made to 
compete with closedness (to be discussed later). Good continuation dominates 
in both figures to create the structure 1 3 ? 7.../2 4 6 8... 

INSERT FIGURES 3.23 AND 3.29 HERE 

The graph maker can exploit good continuation to link a label with its 
associated specifier element. Figure 3.29 shows an extreme case, where group- 
ing by good continuation can actually overcome grouping by proximity to asso- 
ciate labels with their corresponding lines. Furthermore, good continuation 
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can tell the graph maker when in will be necessary to differentiate two lines 
on the basis of color, dots vs. dashes f etc* 

When the segments cf the lines can only be grouped in one way that con- 
forms to the ^xinciple of good continuation (e.g. Figure '3.30a), differentia- 
ting the lim. does not add app<. "iably to the readability of the graph. How- 
ever, when the lines have similar slopes where they intersect/ good continua- 
tion will not favor one organisation over another/ and ambiguity will result 
unless the appropriate line segments are linked to one another by; the principle 
of similarity (compare Figure 3.30b to Figure 3,30c). We have *£ound that a 
recurring cause of ambiguity in line graphs is the perceptual mis-segmentation 
of close/ intersecting lines. The law of grouping by good continuation 
explains this ambiguity and should alert the graph maker to this potential pit- 
fall. 

INSERT FIGURE 3.30 HERE 

Common Fate 

According to the Law of Common Fate/ elements in a moving display that are 
moving in the same direction and at the same velocity will be grouped together. 
In a stationary display, lines that follow the same trajectory across the page 
(i.e./ are parallel to one another) will be grouped together. Thus the curved 
lines A, B, C, and D in Figure 3.31 will be grouped together despite being dis- 
similar and far fro^i each other. Like good continuation/ common fate can help 
the graph reader to link labels to their associated lines, even if a label 
cannot be placed at the end of its line or closer to it than to other lines. 

INSERT FIGURES 3.31 AND 3.32 HERE 

In relational graphs displaying discrete data, such as is shown in Figure 
3.32, a graph maker "is sometimes tempted to emphasize a possible trend or rela- 
tionship by the addition of a continuous line through the swarm of points. The 
principle of grouping by common fate makes it likely that the line and the 
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subset or envelope of points that follow the same curve as the line will be 
grouped perceptually* Grouping by common fate might also allow the possibility 
of abuses in certain cases, given that the visual system may tend to group the 
dots into a structure with ,»e same trend as the line even when no such tren^ 
exists in the actual geometry of the points t'^, , elves. The honesty of sue* an 
addition th<>n depends on the particular situation. The graphmaker, in general, 
owes the graph reader some justification for superimposing a trend line onto a 
collection of dots. If the continuous line represents a locus of prediction, ■ 
based on some theory or a regression line resulting from statistical analysis, 
this should he clearly stated on the graph or in the associated text. 

INSERT FIGURE 3.33 HERE 
In other instances, a graph may display a sequence of discrete data such 
as a time series. Such graphs are often dra;,n with sequential points connected 
by short intervening straight lines, Figure 3.33 shows a time series with and 
without connecting lines. Although this connected dot display format has the 
advantage of making the sequence of dots easier -to follow, Rouse (1974) has 
shown that such a format mafcfc^^J^more difficult to estimate- visually the stan- 
dard deviation about the mean of the data. Subjects systematically underesti- 
mated this statistic for both the connected and unconnected dot formats, but 
their estimates using dots connected by straight lines, were significantly 
worse. Most likely, by grouping the dots into a continuous carve, common fate 
simultaneously made the dots harder to see as a swarm of dots per se (because 
Gestalt organizations compete with each other rather than coexisting perceptu- 
ally). And if it is hard to see the points as a swarm, it will be hard to see 
the properties of the swarm (e.g., its average width) that are necessary to 
estimate i^s standard deviation. 

- TNSERT FIGURE 3.34 HERE 
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Good Figure 

A region that is defined by a closed boundary is more likely to be seen as 
a figure than one which is incompletely closed or left open, especially if* the 
resulting shape is simple and regular. This can be seen in the illustrations 
of Figure 3*34* The progress, n of drawings from left to rit, in Figure 3*34a 
becomes more figure -like as the boundary approaches closure* The effect of 
closure on natural organization is made evident in Figure 3*34b where the domi- 
nant structure emerges as 12/34,56* A portion of this figure, left open by the 
absence of elements 5 and 6, reverts to a 23/*4 structure under the influence 
of good continuation* Finally, Figure 3* 34c shows a modification of a previous 
figure where the interruption of good continuation allows closure to dominate 
the figural organization* 

Good form is employed as an organizing force in charts and graphs in a 
variety of ways* A closed boundary may be used to separate a graph or series 
of graphs from surrounding material such as text* The outer framework of a 
graph may completely or partially enclpse the specifier and, thereby- define 
the coordinate system with respect to which the specifier element* of the graph 
are described (see Chapter 6)* The complete closure provided by the circular 
framework of a pie chart serves this function, as does the partial closure 
implied by the orthogonal axes of a Cartesian framework* It is not unusual for 
closure to be completed in a Cartesian system by a background field of homo- 
genous hue (Figure 3*35a) or by two thin lines completing the rectangle defined 
by the principal axes (Figure 3*35b)* Closedness is employed within a graph to 
define sub-structures such as keys and legends (Figure 3*36) and small speci- 
fier elements appear more substantial and noticeable when defined by closed 

contours* Also, closed symbols and characters ace more dominant than open 

t 

ones* Observe that the letter "o" appears more. dominant than the letter "c" in 
Figure 3*37* 

INSERT FIGURES 3*35, 3*3G AND 3*37 HERE 
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As with the preceding organizing factors, closedness can be and sometimes 
is used to disadvantage in graphs* Figure 3*38 shows a graph which was ex~ 
tracted from a leading national news magazine* the point of this graph is to 
show that wholesale prices (upper subgraph) are increasing while the valu** of 
the dollar relative to the German mark is dropping* The divergence of t. - 
jagged lines lines, which was to be emphasized by the vertical and symmetric 
arrangement of subgraphs, is, in actuality, masked by the closure implied by 
the stretched dollar sign* the graph is mentally encoded as a Jagged circle 
rather than as a pair of diverging lines, which is the encoding tha*: maximizes 
the chances of noticing the trends of interest* The net result of the mis- 
applied closure here is suboptimal graphic communication* 

Often a set of Gestalt principles work in tandem to affect the interpre- 
tation of graphs* For example, in a scatterplot, a set of points representing 
paired observations Df correlated variables are seen as an elongated, 
diagonally-oriented cloud whereas when the variables are uncorrelated a diffuse 
swarm is seen* Tne perception of the elongated cloud is presumably caased by 
the principles of proximity (the noints are closer to one another on the aver- 
age when variables ere correlated), good continuation (going from left-to- 
right, individual poj nts continue the trend of the previoas points), and good 
form (the resulting cloud is "relatively compact and has a smooth envelope)* 
The ability to reatf scatterplots accurately can thus be manipulated by manipu- 
lations that affect these Gestalt laws* Wainer £ Thissen (1979) show that 
people are generally accurate in judging correlations from scatterplots when 
the size of the cloud is not varied (as was done by Cleveland et* al* to 
illicit an illusion), including the ability to detect and compensate for 
"outlier" points* This is exactly what the Gestalt laws would predict, because 
^j^utliers would be perceived as' not being part of the cloud (they do not group 
according to proximity, good continuation/ and good form)* 
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INSERT FIGURE 3.38 HERE 
A corollary: "Goodness" of Parts 

Not only do the Gestalt laws define the perceived relation between one 
part and another they also define the perceived relations between parts and 
vholes*^Ins**inc -vely, some parts of: a pattern are "better" or easier to see 
than others (e.g./ the triangles in a star of David vs. the overlapping paral- 
lelograms), and the Gestalt laws give a more precise definition to this notion 
of "goodness" of a part (Wertheimer/ 1934). Specifically, a part will be 

"good" to th*s extent that 1) its own subparts are linker* together by the 

■ <* . 

Gestai: laws/ and 2) the subparts do not link- up with other subparts composing 
the rest of the figure. The overlapping triangles in a Star of David meet 
these criteria, but the parallelogram in the middle does not. That is/ the 
subparts of the triangles are grouped according to the Law of Good Form/ and 
these parts do not naturally group with other subparts. The parallelogram/ on 
the other hand, is composed of subparts that are more naturally grouped into 
other parts (the two triangles) by the u Law of Good Continuation. Thus/ 
triangles are good, easily- seen part*, whereas the parallelogram is a bad/ 
hard-to-see pare. As we snail see when we turn to processing limitations/ the 
"goodness" of parts will prove to be an important determinant of how easy it 
is to extract various types 6t information (e.g., simple values/ differences, 
trends) from various sorts of displays (e.g., tables/ bar graphs, line 
graphs) . 

2* Integral and Separable Dimensions of Visual Perception 

The coregoing principles all determine how separate marks will be grouped 
together. There is another kind of principle that not only sometimes deter- 
mines hew separate lines will be grouped, but determines which dimensions (such 
as hue, saturation, intensity, height, and width) will be grouped together. 
That is/ there are cases where we cannot help but attend to one thing given 
that we are attending to another. These sorts" of dimensions are called inte- 
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gral * According to Garner (1970), "Two dimensions are said to be integral if, 
in order for a level on one dimension to be realized/ a dimensional level must 
be specified for the other* 11 If the specification of one dimension in no way 
influences the specification of the other dimension/ then the two a*- spara- 
ble* When it corves to perceiving v lues on dimensions/ however/ some dimen- 
sions act as if they were integral (i*e*, one cannot notice one without the 
other) ever though geometrically it is possible to specify values on one inde- 
pendent of the other* Thus, it becomes a matter of psychology/ not geometry/ 
to determine which dimensions bind into single units* 

Integrality /separability becomes important to the study of graph percep- 
tion when we consider that graphs usually convey quantitative information by 
depicting elements that can be specified in terms of their values on a number 
of dimensions (e*g*, when a quantity is communicated via the length of a bar, 
the width of a circle sector, the darkness of a patch, etc*)* To extract the 
meaning of a graph, one must mentally "describe" its elements in terms of their 
meaning-bearing dimensions and then translate that visual description into 
appropriate conceptual "message" (see Chapter 6)* Thus it is crucial that the 
reader mentally encode the graphic elements in terms of dimensions that can, in 
fact/ be translated into the quantitative variables being communicated/ for 
example, the width of a sector in a pie chart and not its angular position; 
the heights of bars in a bar graph with a nominal scale and not the degree of 
curvature of the contour formed by their tops; the relative slopes of lines in 
a line graph and not their relative lightness* The study of the integrality 
and separability of perceptual dimensions tells us to what extent these percep- 
tual encodings are possible; that is/ whether one can ignore the position of a 
sector while attending to its width, and so on* If not, the reader will be 
forced to keep in mind various attributes of the graph that have no communica- 
tive function and just consume precious short-term memory capacity* 
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INSERT FIGURE 3*39 HERE 
Two experimental techniques are often used to determine if two dimensions 
are separable or integral* In one, subjects are asked to sort a deck of cards 
into two piles* On each card is a two-dimensional stimulus, and each stimulus 
can have ±ues on two dimensions (e*g*, a circle varying in size with a radius 
varying in angular positionj geometric forms that vary in both height and 
width)* Figure 3*39 illustrates a pertinent example, a simple stimulus with 
two bars, each of which can vary in height* The basic card sorting task 
requires the subject to sort the cards on the basis of the value of only one of 
the dimensions, such as the height of the left bar in the stimuli of Figure 
3*39* So, all cards with a short left bar would go in one pile, and all cards 
wj^th a tall left bar in the other* How, the values of the irrelevant dimension 
(the height of the right bar) are systematically varied; either they are 
constant (i*e*, they are always tall), or they are independent of the values of 
the other dimension (i*e*, they are either short or tall when the left ones are 
short) or they are redundant, reinforcing those values (i*e*, they are always 
tall when the left bar is tall and vice-versa) * if the dimensions are inte- 
gral, and here is the key prediction, then whether the two dimensions are con- 
stant, correlated, or independent will affect how easily the subject sorts on 
the basis of one of the dimensions* So, if the pair of bars are in fact pro- 
cessed integrally as a single unit, then the relation between them cannot be 
ignored: sorting the cards according to the height of the left bar will be 
easier when the height of the right bar is correlated with it, compared with . 
"hen the right bar is constant* The subject cannot help but pay attention to 
the height of the irrelevant bar, and its value being correlated with the vari- 
able of interest will improve sorting speed and accuracy* Similarly, with 
integral dimensions, the independence of the values of the two dimensions will 
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hinder speed and accuracy compared to when the irrelevant one is constant; the 
subject will be continuously "distracted" by the values on irrelevant dimen- 
sion, even though in this condition it provides no useful information* On the 
other hand, if the bars are separable, then the relation between them will be 
irrelevant, and it will mtk^ ro difference how the rig* r varies when people 
are asked to sort cards with a short left bar in one pile and a high left bar 
in the other* : 

The second technique used to discover which dimensions are integral and 
which are separable rely on "multidimensional scaling" of stimuli* In these 
studies subjects are first asked to assess the similarity of each member of a 
set of stimuli with each other member* These similarity data are then used as 
inputs to one of a number of standard multidimensional scaling programs (see 
Kruskal, 1964)* In the outputs of these analyses, each stimulus is represented 
as a point in a space and the distances among the points are proportional (or 
as close to proportional as possible) to the subjects' similarity judgements, 
with more similar items (usually) being closer together in the space* Now, 
there are different ways for the program to measure "distance between points", 
different distance metrics , when it creates the spatial model conforming *o the 
similarity ratings* "As the crow flies" and f 'as walking along city blocks" are 
two examples of measuring the distance between two buildings in a city* And, 
in fact, these two metrics, called Euclidean and City Block, respectively, have 
proven important for understanding how different dimensions are processed 
together in perception* when people assess the similarity of stimuli composed 
of integral dimensions, a Euclidean metric allows one to construct a better 
spatial model representing the similarities; in contrast, if stimuli are com- 
posed of separable dimensions, a city block metric does a better job* This 
relationship is a consequence of the mathematics of how the metrics are compu- 
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ted* Intuitively, if the dimensions are separable, then their effects will 
simply add* And in computing city block distances one merely adds the "legs of 
the triangle," specifying distances along the x and y coordinates of the space 
(let us assume it is two-dimensional for this example)* If they are integral, 
however, one "takes the hypotenuse," or jw's flight", which combines non- 
additively the individual contributions of the two dimensions* For a technical 
description of the underlying mathematics, the interested reader is referred to 
Attnt-ave (1950), Shepard (1964) and Torgerson (1958)* It is comforting that 
this very different experimental techniques leads to the same conclusions as 
Garner's card sorting task* 

A nunber of investigations have used both the basic card-sorting technique 
and multidimensional scaling with a wide variety of different dimensions* most 
of which could be incorporated in a chart or graph* Tkble 3*14 summarizes 
these findings, indicating which dimensions seem to be grouped into integral 
units; the table also provides an illustration of each kind of dimension* 

INSERT TABLE 3*14 HERE 

If the elements in a graph vary along a single dimension, dimensional 
integrality will play no significant role in comprehensibility* But we have a 
strong prediction when several dimensions vary at the same time; if the dimen- 
sions are integral, covarying them will make it easier for the reader to encode 
che information they convey, but if they vary independently t the information 
will be harder to interpret* For separable dimensions, the degree of correla- 
tion will not affect comprehensibility as much* tfor example, the height and 
width of rectangles are integral dimensions* If the height of rectangles con- 
vey, say, oil reserves of a country, then varying their widths simultaneously 
will increase the impression of whatever differences may exist, at is illustra- 
ted in Figures 3*40a and b* But if the widths vary on their own, as in Figure 
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3*40c, where width might signal/ say, population size^ then differences in 
the first variable will be harder to detect—because the width cannot be ig- 
nored when height is attended to, and intrudes into short-term memory* For 
separable dimensions (e*g*, rectangle height and the curvature of the lines 
filling them), this effect is not as apparent, as is i ■ crated in Figure 
3*40d* 

INSERT FIGURE 3*40 HERE 

III* Processing priorities and Limitations 

Not all stimuli are niven equal treatment; such are life's injustices* 
Some stimuli are inherently more "salient" than others, and as such grab one's 
attention at the outset* The factors that determine stimulus "salience," then, 
will also determine what it is that one is likely to notice and remember about 
a graphic display* These factors are especially important in light of the fact 
that one will not remember everything in a display* The virtual inevitability 
of imperfect recall is in part a consequence of limitations on our initial 
processing of a stimulus; in particular, by limitations on the "span of appre- 
hension" and the amount that can be held in short-term memory at once* Thus, 
in this section we will consider two kinds of principles, those pertaining to 
stimulus salience and those pertaining to limitations on short-term stimulus 
encoding* 

A* The principle of Stimulus Salience 

Under certain conditions, the perceptual properties inherent in a visual 
display will determine the likelihood and order that a given part is noticed 
and remembered* By varying the "salience" of the marks, one can facilitate the 
correct interpretation of the information content if the most salient marks 
draw the reader's attention to the most important part of the display* The 
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principle of stimulus salience, then* exhorts one to vary the marks such that 
the right information is likely to be encoded* what constitutes the "right 
information" is, of course, up to the chart or graph maker to decide* But what 
properties of marks dictate the priority of encoding? This- unfortunately, 
cannot be answered definitively in the general case* tfo s ^le dimension, such 
as color, necessarily takes precedence over other dimensions, such as shape* 
Forexample, consider two shapes chat are colored differently* If the shapes 
are a circle and a very circular elipse, the difference in colors will probably 
be more noticeable* But if the two shapes are a circle and a triangle and the 
colors are barely different shades of orange, then the shape will be the more 
salient dimension* Thus, the chart or graph maker must use his or her own 
intuitions about what the salient dimensions are in a given display—which is 
not particularly difficult, once one is alerted to the role that such 
perceptual "saliency" plays In emphasizing particular aspects of a display over 
others* in fact, in general one can expect large differences between values 
(e*g* size, brightness^color) of objects, and extreme values of a single 
dimension, to be perceptually salient; The human perceptual system Is often 
characterized as a "difference-detector" or "variation-detector" rather than as 
a detector of steady states or constant stimulation (Lindsay & Norman, 1972; 
Helson, 1964), so we can expe'ct that discrepancies and differences, especially 
extreme differences, among stimulus values will capture the reader's attention 
and find their vary into his or her encoding of the visual aspects of the 
graph . 

INSERT FIGURE 3*41 HERE 
For example, consider the graph in Figure 3*41a, which displays a pair of 
parallel lines* Under normal circumstances the lines will be interpreted, to 
indicate that A is greater than B, with A and B being in a simple relation to 
each other* Suppose, however, that we want to emphasize that B is less than A 
(e*g*, that the level of US military readiness is less than that of the USSR)* 



In this case, we want A to be the baseline and B to be defined in relation to 
it* If we assume that the line noticed first will serve as the "assertion" 
(and this should really be tested empirically), then simply by varying the 
weight of the line we can vary which serves as the baseline* Consider Figure 
3.41b; now the fact that B is less than A seems to jump out, - notic* 
first and then its relation to A. • 

INSERT FIGURE 3*4 2 HERE 
As an example of how the Principle of Stimulus Saliency can lead to im- 
paired interpretation* consider the graphic display of Figure 3.42a. This dis- 
play illustrates the monthly total flow in two drainage basins* The background 

is more varied and coDplex than the information-bearing components of the, dis- 

* 

play, and seems to draw one's attention from those components. Simply by mak- 
ing the lines delineating the background finer, and those comprising the dis- 
play proper bolder, we greatly improved the legibility of the graph — as is 
evident in Figure 3.42b. 
B. Principle of Finite Capacity 

This principle has two parts, one pertaining to the limitations of short- 
term memory and one pertaining to the limitations of re-organizational process- 
es used during encoding. 
1* Short-term Memory Capacity 

As a general principle, Less is Best: Human beings can only hold in mind 
a total of about four units at once# and hence should not be required to do 
more than this in order to comprehend a graphic display, ihere is a long his- 
tory of study of the limitations of short-tern memory, which has led to a vari- 
ety of conclusions. Everyone agrees that memory is severely limited/ but the 
question has been one of how much so. T?he problem in measuring memory limita- 
tions is that the correct unit is not necessary determined by any* simple 
measure of the number of stimulus elements. For example* if one gave someone a 
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list of digits to remember, the person could reorganise the list into pairs of 

digits (e*g*, H twenty-one rt rather than "two** and "one"). In this case, there 

would be half as many "psychological units" in the mind of this subject as in 

the initial set (as conceived of by an experimenter, who considered each digit 

as a unit)* And there is nothing from stopping the subject from organizing the 

digits into groups of three or more* Thus, it is critical to determine how 

elementary units are organized into each "chunk" (the technical name of a set 

* 

of information that is held in memory as a unit)* The classic paper on the 
subjept, by George Miller, lays out one hypothesis in the title* "The magical 
number seven, plus or minus two; some limits on our capacity for processing 
information*" However, we fear that this number is more magical than accurate* 
This number is suspect because subjects may have organized stimulus elements 
into fewer psychological units* In the course of studying the process of or- 
ganizing elements into units, Ericsson, Chase and Faloon (1980) provide support 

for a different estimate* 
* 

Ericsson et* al* asked one subject to return to the laboratory five times 
a week pver the course of nine months* At each session the subject was given a 
set of digits to recall* Amazingly, the subject gradually built up to the 
point where he could remember 79 digits! $he digits had to be presented rather 
slowly, however (one every 5 seconds) to allow the subject time to organize 
them into units* The nature of these units changed over time* The critical 
observation Ericsson et* al* made concerned sudden jumps in performance, which 
occurred when the subject discovered a new, more efficient way of grouping* 
Critically, at each jump, the number of digits retained was some multiple of 4. 
At first, the subject remembered pairs of digits, producing a span of roughly 
8* At the end, he was able to form groups of 20, and was able to describe the 
grouping strategies he developed, which perfectly predicted digit span of 4 
units were in fact retained* This estimate turns a sow's ear into a silk 
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purse: the grouping strategy which previously had obscurred the number of 
units being stored now was used to implicate that very factor*- And the answer 
was almost precisely 4* 

A number of studies of charts and graphs per se have documented the 
effect. * requiring the reader to keep too much informal in memory at once* 
Washburn (1927) and Vernon (l952)"found decreased accuracy in answering ques- 
tions about a graph as the amount of information that had to be remembered 
increased* Perhaps more interestingly, Schutz Cl 961 ) found that it was better 
to plot several lines within a single framework than to plot them on multiple 
frameworks it_ one was asked to compare values or trends* In contrast, if one 
was asked merely to retrieve single values, the way the functions were plotted 
made no differences* This is not surprising, given that only when comparisons 
are required need one remember where along a function one must make comparison* 
In this case, having the lines one above the other saves one the effort of 
remembering the location and the result of making the comparison for each fun- 
ction: now one need only move one's eye up the page, holding a minimal amount 
of information in mind at once* 

The use of keys and legends often will violate the principle of finite 
capacity* A key is equivalent to a "paired associates" task, where a person is 
asked tdyneraorize an association between two stimuli* In this case, one must 
memorize the pictorial information that indexes the different functions (dotted 
lines, different colored lines, etc*) and the label, and then mast match the 
line segment in the key with the proper specifier element in the graph* In 
contrast, if one labels the functions directly, there is no need to perform 
this memory task—which is a boon even if the amount of material in the key 
does not tax memory* Compare Figures 3*43a agd b to' see what we mean* Thus, 
it is not surprising that Culbertson and Powers (1959) found that labels and 
pictorial symbols that are directly associated with a function are easier to 
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lfead than k— /s* Not only does this system save memory effort, but the close 
proximity of the label and function serves to group them into a single unit 
(via the Gestalt Law of Proximity discussed earlier), eliminating extra 
processing rea n ired to look up which function is referred to by a label in a 
key* Similar ^sults have been obtained by Parkin (1981)* 

INSERT FIGURE 3*43 HERE 
One^ast study of capacity limits in graph comprehension must be noted* 
Price, Martuza, and Grouse (1974) investigated whether subjects encode point 
and/or slope information when they are instructed to learn the information 
presented in a line gra^n* The authors conclude from their results that sub- 
jects encode datapoint information and not slope information, and that in order 
to answer questions about slope, subjects recalled point information and then 
inferred the slope* This conclusion was supported by the finding that as the 
number of data points needed to respond correctly increased, performance de- 
creased* This last result is just as expected from the principle of finite 
capacity under the assumption that subjects do in fact encode point informa- 
tion* Why they would do' this instead of encoding slope per se, which involves 
fewer chunks or units and hence less demands on memory, cannot be stated with 
certainty* Both the specific instructions the subjects were given initially 
and the specific questions they were asked could possibly have biased the en- 
coding strategy, but we cannot know for certain because neither are described 
t in detail in the paper* If, in fact, subjects can be "set" to encod^graphic 
displays. in different wa^s, it ^ e verv important to study the textual 

context in which a graph is {placed* This clearly is an area begging for sys- 
tematic study* 

A straightforward consequence of the principle of finite capacity and the 
principle of stimulus saliency can be stated as follows: A ch^rt or graph 
should not convey more or less salient information than is necessary for the 

\ 



13c, 



purposes for which it was constructed* In the ideal case only the information 
necessary to extract the intended message should be incl £zd as salient marks 
in the display* ihis pertains to two considerations: how much information is 
included in a graph itself, and how much non-information bearing visual 
material (e.g., p- iTial backdrops) is drawn togethe zh the graph. As for 
the former, Tufte (1977) and Wainer (1977, 1978) caution against the temptation 
to make a graph serve as an archive for large amounts of data in circumstances 
where the communication of some part of the data is intended: in such cases 
the unnecessary material can overload the reader's capacity to the point where 
the intended message is inaccessible. TcTtake a simple example* 
Figure 3.44r, which is intended to illustrate the Ye rkes-Dodson Law (which 
states that performance will be more accurate with intermediate levels of 
arousal than in very high or very low arousal)* As a general illustration of* 
the Law, the idealization in Figure 3.44a is to be preferred over Figure 44b, 
which presents too much detail. However, in some cases complexity is unavoid- 
able: if one wants to know the additional details, for example, differences 
between men and women, then the idealization in Figure 3.44a is inappropriate 
and the raore complex Figure 3.44b is appropriate. 

INSERT FIGURE 3.44 HERE - 
The second practise that can make graphs too complex is*the inclusion of 
superflous salient material in a chart or graph. Tufte (1978) and Wainer & 
Thissen C 1 981 ) criticize the unthinking use of what Tufte calls "chart juhk", 
and urge that the "data/ink ratio* (the ratio of ink used to convey information 
and ink used for decorative purposes" be as close to 1 .0 as possible. As a 
simple example of the capacity demands of superfluous graphic decoration, com" 
pare the two graphs presented in Figure 3.45. Ifte leftmost one contains a set 
of background elements that vary in size and position, thereby becoming percep- 
tually salient and seeming, on first glance, to convey information. But in 
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fact these figures are used in an attempt to make the display more attractive 
or interesting, ^and convey no information in their own right* The middle graph 
simply removes these distracting elements, making the relevant information more 

-*sily seen* On the far rinht is a graph *~ s at retains the decorative elements 
^resent in the initial ones, but now reduce, the problem of interpretation by 
making the figures lighter than the information-bearing components (and thus 
taking advantage of the encoding priorities of the visual system and of group- 
ing by similarity), and keeping t?2 size and orientation of the background 
elements constant—and hence reducing their salience, their encoding likeli- 
hood, and the impression that they convey infprmation* 

INSERT FIGURES 3*45 AND 3*46 HERE 

As another example of cases in which presenting irrelevant information 
impairs interpreting a chart or graph, consider Figure 3.46a. Here again the 
background is patterned merely to make the chart or graph more interesting, but 
the reader cannot know this at first. By simply removing the background, legi- 
bility is improved immensely, as is evident in Figure 3.46b. We should add, 
however, that if a graph maVer insists on decorating a graph, the use of the 
other Principles discusseo in this book (e.g., as illustrated in Figure ,-45) 
will allow him or her to embellish a display without necessarily saddling the 
reader with' the task of sorting the kernels from the graphic chaff. 

A final example of how short-term memory limits and graph comprehension 
concern the duration of transient memories. Peterson £ Peterson (1959) showed 
that unrehearsed information in short-term memory decays within about 20 
seconds (see Klatsky, 1975, for a review of related findings, including the 
controversy over whether it is time per se or the processing of interfering 
material in the retention interval that causes information lossh A graph 
inker shbuld not force a reader to retain information necessary to interpret a 
graph for a long time* An obvious place bo keep this L in mind is the relative 



placement of a graph and the portion of the text referring to it. In fact, 

Whalley and Fleming (1975) have found that when a display is separated too far 

from the part of the text that discusses the information displayed in it, often 

the reader will not even lock at the display* 

2. Comparing Units or Parts of Units 

James Pomerantz, now at SONY Buffalo, used Garner's card-sorting technique 

to examine how marks (not values on dimensions) cohere into units* He. found 

that sorting tine was affected by how separate narks grouped into units* For 

example, people could sort the two stimuli "((" and"))" on the basis of the 

left parenthesis faster than they could sort ")(" „nd "()" in terms of the left 
in 

parenthesis. The interesting thing here is that the Gestalt Laws of Common 
Fate and/or Good Figure operate bo bound the two elements in each pair into a 
group, and it is difficult to consider a part independent of the entire unit, 
"Breaking up' 1 a perceptual unit and seeing one of its parts in isolation can be 
done, but it taxes our limited processing capacity. The effort involved in 
seeing a part in isolation is especially extreme when the part is a "bad part" 
as defined in the section on GestaJLt Laws* When a bad part" (such as the 
parallelogram in the center of a star of David) must be attended to, there must 
be greater allocation of the mental resources that would otherwise be used in 
maintaining information in short-term memory* And hence* there will be poorer 
performance in general* 

In the case of charts and graphs, it is critical that a small arbitrary 
segment of a continous line is a "bad" part by our earlier definition* The Law 
of Good Continuation tends to cause any single segment ot a line to be absorbed 
into the whole* In contrast, a single bar in an array of bars would be <* very 
"good" part because it is differentiated from the other bars by the Laws of 
Good Form, Proximity, and Continuity* By the. same token, however, it should be 
easier to attend to a line as a whole thin to groups of bars as a whole* The 
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line is seen as a single unit--and stored as a single "chunk'^-whereas the bars 
are seen as man} units/ which would be more difficult to process and store* 
Therefore, we are led to a prediction: tasks that require reading information 
about a specific value—and h«nce attending to a single point— should be easier 
for bar graphs than for line ,raphs/ whereas tasks that reguJ attending to 
the entire information set should be easier for line graphs than bar graphs. 
For example, reporting single values along a function (e.g., the amount of oil 
produced over time) should be easier with bar graphs/ but reporting trends (the 
rate of increase) should be easier with line graphs. Ifte literature on graph 
comprehension summarized below bears out this prediction. 

One of the classic studies of graph comprehension and use was reported by 
Washburn (1927). she presented junior high school children with an essay on 
the economic history of Florence/ and embedded in it a body of data that was 
displayed in different forms to different groups of students (the forms includ- 
ed a prose paragraph/ a unit pictograph/ a bar graph and a line graph), sub* 
jects studied the paragraph and then answered questions about the data, the 
questions pertained to the absolute amounts/ differences between amounts, and 
relative increases and decreases. The efficacy of the different formats proved 
to depend on the type of data to be extracted. If the viewer had to report on 
the value of one variable given the value of another (in a set of x,y ordered 
pairs), a table leads to faster and more accurate performance than a graph; 
this result was also found by Carter 0947a) and Narwrocki (1972). On the 
other hand, when subjects must report on differences between two values of one 
variable corresponding to two values on another variable, or when they must 
compare sets of differences (i.e./ trends), bar graphs and line graphs are 
(respectively) t-he more effective media. 

The fact that- line graphs are ineffective if a reader needs to know abso- 
lute quantities was demonstrated by Culbertson and Powers 0959). Subjects 
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were required to note and compare specific quantities on various forms of 
graphic displays* Both horizontal and vertical bar graphs were found superior 
to line graphs (and th.-re was no difference between the effectiveness of the 
two types of bar graphs)* When either line or bar graphs were segmented they 
were less effective, which, is interesting because the segmentation was ar 
trary (segments did not correspond to any meaningful variation along the x 
axis)* Thus, if subjects attended to individual segments as parts — which seems 
likely because these segments were perceptually "good" parts— they were not 
attending to the meaningful information-bearing part^* Lastly, when the speci- 
fier elements were presented over an inner framework consisting of grid lines/ 
the graph was more effective-- presumably because the grid lines helped to 
segment the parts of the line or bars that had to be attended to, and linked 
these parts via proximity and good continuation to the labels on the x axis* 

Schultz (1961) provided evidence consistent with our other prediction, 
that line graphs should be superior when information about trends had to be 
extracted* He showed subjects line graphs, vertical bar graphs and horizontal 
bar graphs* The subjects' task was to compare test graphs with a previously- 
j.earned set of patterns and rules for naming trends* Subjects were to study 
the test graph and choose the matching* pattern and rule* Line graphs were 
found to be superior* This study is flawed, however, in that the task may 
simply have been easier with line graphs because the test patterns had origin- 
ally been presented in the form of line graphs* (see MacDonald-Ross, 1977, for 
a. more complete survey of the literature on human graph reading)* 

Thus, we have reasonable support for our claim; it is easier to extract a 
single value from a bar graph or table than to read it off a line graph, pre- 
sumably because is that in the latter case one must break up a single percep- 
tual unit, the line, into "non-good" parts* But if the line does not need to 
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be broken into points, it now is more eff ective— *as one would expect given that 
more information is represented in fewer chunks (assuming a line is one chunk, 
as is each bar in a graph or entry in a table)* So, reading a trend is easy 
with a line gra£?h because the information is inherent in the single unit that 
can be looked up as such* Ta contrast, extracting gener*. crend information is 
harder with other media, where the trend must be computed, keeping a number of 
chunks in mind at once* Note that if precise differences are required, how- 
ever, how bar graphs are best — here one must extract precise value first, which 
is difficult with line graphs* In short, then, as long as one does not nae^to 
decoppose a perceptual unit into smaller parts that are not "good" parts 
according to the Gestalt laws, then it seeircs safe to say that the fewer units 
one uses in displaying information, the better* 
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CHAPTER 4: SEMANTIC, FORMAL, AND PRAGMATIC PRINCIPLES 



In the previous chapters we have reviewed syntactic principles, which are 



ciples are content-free in that they operate independently of what the lines 
mean* Even if none of the syntactic principles is violated, and hence one can 
detect the marks, read them without distortion, organise them correctly and 
hold the relevant information in mind at once, a chart or qraph may still be 
defective at a semantic level of analysis* The semantic analysis assigns 
meanings to the elements and the relations among them* If the wrong semantic 
interpretation is assigned to a given mark, the chart or graph obviously will 
not communicate effectively* 

Similarly, the mapping from mark to mark in a display may be faulty, or 
the wrong inferences may tend to be drawn* In such cases, a formal or prag- 
matic principle has been violated* In this chapter, then, we will consider 
semantic, formal, and pragmatic principles* 

Ultimately, all the changes one makes to improve a chart or graph will be 
made at the level of syntax* Once a violation of one of the semantic, formal, 
or pragmatic principles has been detected, it can be rectified by altering the 
Hnes themselves* But one should be careful to distinguish violations at the 
level of syntax proper, such as those discussed previously, with semantic, 
formal or pragmatic violations* These latter violations only come to light 
when the chart or graph is considered in its role of a communicator of specific 
information in a specific context, as discussed below* 



We have formulated two general semantic principles* The first is concern- 
ed with the compatibility between the mark used to convey information and the 
intended meaning* some symbols are better suited for a given role than others* 



concerned with how our visual systems interpret marks . . a p 
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These first principles, "The principles of surface compatibility/' have three 
distinct aspects, as discussed below* The second principles are concerned with 
the kind of knowledge one must have in order to understand a concept, Tftese 
"principles of schema availability" have three aspects* ranging from the 
availability of single concepts to individual differences in the availability 
of knowledge of specific graph types* 
1* The Principles of Surface Compatibility 

The basic message of these principles is straightforward* The format of a 
display should be compatible with its spontaneous interpretation* if a mark is 
spontaneously described in a way incompatible with what it represents, the 
graph maker is in trouble* This principle has three major aspects, which are 
implicated in the literature summarized below* 
a* Typicality 

In a series of very important and ingenious experiments, Eleanor fcosch 
reported findings that are relevant to how graphic displays should be labeled 
(see ttosch, 1978, for a more detailed nummary)* fcosch distinguishes between a 
"horizontal" and a "vertical" level of classification in a taxonomic hierarchy* 
For example, take the famili *r hierarchy of the animal kingdom, where each 
beast is a member of a species, a genus, a family, an order and so on* Within 
a given level, say species, some instances are more typical or "representative 0 
of the category than otheis* For example, a collie is a more typical dog than 
is a pekinese* This kind of variation defines the horizontal , wi thin-category 
dimensions* In addition, any given example can of course be assigned a classi- 
fication at numerous levels of hierarchy* The collie is not only a collie, but 
a canine c mammal and animal as well* This kind of variation defines the verti - 
cal dimension* Just as there is a best example of any given category* there is 
also a "best category" for any given example* when we see a dog we spontane- 
ously classify it to ourselves as a dog, not a mammal* The level at which we 
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spontaneously classify a typical object is called the basic level * The basic 
level is that at which the examples are as similar as possible while the cate- 
gory itself is as general as possible* So, to take another example, the basic 
level category f^t % Macintosh apple would be apple, fruit or Macintosh 
apple* The reason is that although "fruit" is a more general category, the 
examples within the category are rather dissimilar (watermelons and tomatoes 
don't have much in common)* "Macintosh apple," on the other hand, is a cate- 
gory with very similar members, but not much more similar than the more general 
category "apple*" Interestingly, the horizontal and vertical dimensions mutu- 
ally effect one another; atypical examples, such as a penguin, are not named 
at the basic level* Rather, they are spontaneously named at the most specific 
level ( Jolicoeur, Gluck, & Kosslyn, submitted) * 

The relevance of this work on categorization is clear when depictive sym- 
bols are used in a chart or graph, either as bar elements (e*g*, in a picto- 

T 

graph), background figures, or as labels* First, typical members of the cate- 
gory always should be used* "Birds" are best symbolized by robins, not pen- 
guins* Second, one should avoid a picture whose "basic level 1 * differs from the 
level that is the subject of the communicated message* Hakes should not be 
pictured if the picture is to stand for tools, since the basic level for a 
specific rake is "rake", not "tool", "object", or " leaf rake", so "ra)ce" is 
what the reader is likely to think when he or she sees the picture* To symbo- 
lize "tool", use some object whose basic level is appropriate for tools — such 
as a tool box, which will probably be encoded as such and not as a "box" or 
"electrician's tool box"* The rules for determining in advance how a picture 
will be. named are complicated and not yet totally worked out* The artist 
should merely show a depiction to a couple of people (wh^ are representative of 
the intended audience) and ask for its name; if the name spontaneously given is 
not correct, the drawing must be altered* 
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Consider some more concrete examples; Figures 4.1a and b show alternative 
presentations of the same information. But in Figure 4.1a the reader can be 
mislead into thinking that the graph is about cars--but in fact the gruph is 
about the rising price of gas, *' '-h is clearly evident in Figure 4.1b. In 
Figure 4.1a/ the basic lev^l of trie framework is "car," not "oil-burning 
vehicle", and this conflicts with its role as a constituent in a graph about 
gas. Figure 4.2 provides another example of tow this principle can be used to 
enhance graphic communication or, if violated, can impede it. The specifier in 
Figure 4.2a consists of pictures of different kinds of trees, the heights of 
which represent how much ' s **: sort of tree grows when soil is treated with 
sewage. The immediate meaning of the "bars" in this graph is the very kind of 
tree being represented, which serves to reinforce the message of the graph. 
Now consider Figure 4.2b, in which barrels (presumably of sewage) are used in- 
stead of trees. Now one is set to wondering about different amounts of sewage, 
which is not the point (or even indicated) in the graph. Even though the basic 
level of each tree is "tree", not "tamarack" or "pine", as one would wish in 
this case, "tree" is closer to ^tamarack" than "barrel" is in a person's mental^ 
dictionary, and henc;e the various trees would come to the reader'^ mind sooner 
in the first case than the second. 

INSERT FIGURES 4.1 AND 4.2 HERE 

b. Congruence 

This aspect of the principle of surface compatibility has four parts, all 
of which deal with setting up a "natural" correspondence between stimulus prop- 
erties and the information they convey. 

Cognitive compatibility . Perhaps the most basic form of cognitive com- 
patibility occurs when one makes sure that the physical characteristics of a 

mark, particularly its size and color, are appropriate for the information one 

* 

wants to convey. The description of the marks themselves should not contradict 
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their meaning. As an extreme case of mis-alignment, consider the Stroop pheno- 
menon; If I present you with the word "red" printed in blue ink, and ask you 
to name the color of the ink, you will experience interference* The meaning of 
the marks conflicts with the color itself* A Lv-qe mar)* will be described as 
large, and hence should not be used to represent- something small* Similarly, 
one should not use small font to spell out the word "elephant" and large font 
for the word "fly" or complex lettering for the word "simplicity 11 * 

INSERT FIGURE 4*3 HERE 

Figure 4*3a provides a somewhat subtle, but nevertheless troublesome, 
violation of the principle of surface compatibility* In this figure we see 
three groups of three bar elements* Each group refers to a particular output 
measure in a medical experiment and each bar within a group indicates the out- 
put level achieved by one of these types of treatments* According to the ver- 
tical axe's, the unit being measured is the percent deviation from a base value 
achieved by an untreated group of animals* However/ the physical base of the 
bars is clearly the horizontal axis of-the graph—which represents minus infin- 
ity* m order to interpret the information being represented the reader must 
pay attention to the empty space oetween the bars when they are less than the 
baseline, and a relatively small part of the bars when they extend above the 
baseline* This format clearly conflicts with the concepts of "a little better, 
a little worse" which are being represented (relative to the untreated con- 
trols)* Consider how much more obvious are the results when they are graphed 
as in Figure 4* 3b* In this figure/ "better" and "worse" correspond in a simple 
way to a simple relation relative to the baseline* 

Many of the violations of this principle involve color* Figure 4*4 
presents a common use of color in charts* different colors are used to stand 
for different proportions of households with pet fish* Most people have 
trouble in reading these graphs, however/ because different colors do not align 
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themselves into a single dimension to the eye* These kinds of dimensions — in 
which the values differ qualitative ly — can be contrasted with others* such as 
loudness, where the values differ quantitatively * Red is not "less blue", 
whereas 100db is less loud than 200db* Qualitative stimulus dirnen^ '. ns# such 
as color* should" not be used to represent quantitative conceptual dimensions* 
Figure 4*4b presents the same data using degree of shading as the differentia- 
tion—note how much easier "it is to compare the different regions in terms of 
relative numbers of pet fish (see also Wainer £ Francolini, 1980, who show that 
maps using transitions from one hue to another to illustrate a continuous 
variable are difficult to understand)* 

INSERT FIGURE 4.4 HERE 

An exception to this rule sometimes occurs for isolated parts of qualita- 
tive dimensions — Guinor and Stevens (1967), for example, found that green* blue 
and violet ordered quite naturally into a continua* as did red* orange and 
yellow— although the two sub-con tinua themselves do not naturally align in 
terns of a progression from "less" to "more* 1 * ■ 

Other psychological princi- diat bear on cognitive compatibility can be 
gleaned from the literature on synesthesia and cross -modality matching (Marks, 
1982)* Synesthesia occurs when perception in one sensory modality i s accom- 
panied by sensory experience in another* For example* many people report 
"seeing" colors appropriate to the music they are hearing* The pairing of- 
colors and tones is not arbitrary (e*g*# low tones go with blue, high with 
yellow), and the pairing found in synesthesia i s also found in cross-modal 
matching* Cross modal matching involves a person selecting values along one 
dimension, such as color* to be paired with values along another* such as 
pitch* People can pair cross-modal experiences very reliably* including some- 
what bi2arre combinations such as beer taste with pitch! In addition, people 
are near unanimous in judging that the vowel "a" (as in "bake") sounds or feels 
more yellowish than the vowel "o" (as in "not"), which feels more brownish or 
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blackish* One would therefore expect that visual dimensions that connote other 
sensory dimensions would make better symbols in graphs for those dimensions 
than 'other visual continua would* The only empirical test of this prediction/ 
Cuff's 0973) experiments on temperature maps, was taken to dj , firm the 
prediction; he found tha't blue was as effective as red ia conveying mean 
temperature of regions on a map/ and that a blue-red continuum was less effec- 
tive than blue alone or red alone (see also Wainer & Francolini/ 1980)* How- 
ever/ this may have nothing to do with the relative effectiveness of red and 
blue to symbolize temperature — it may simply, reflect the ease of: perceiving red 
and blue as lying along a*ccntinuum, as noted above (recall that colors vary 
qualitatively, not quantitatively)* Similar effects probably occur with what- 
ever this color scheme is used to symbolize* , 

In addition, although most synesthesia and cross-modality matching 
research examines the compatibility of one sensory dimension with another sen- 
sory dimension, it seems likely that certain abstract dimensions may "look 
better" when visually represented one way rather than another* For example/ 1 
our intuition is that the military strengths of nations are represented well by 
different sizes on a ma£ or different thicknesses of borders if defensive 
strength i s emphasized, whereas it would be less natural to represent average 
life expectances of the nations in those ways* 

Naming space* There is considerable evidence that there are some general 
principles of how we conceptualize visual space/ and these general principles 
can be of use to the graph maker* Our preference for conceptualizing visual 
scenes purportedly arise from fundamental constraints op how we conceptualize 
actual visual space in the real world (see Clark, 1973/ Clark/ Carpenter £ 
Just, 1973)* Specifically/ the important dimensions of physical space are 
those relative to the observer/ namely position relative to ground level and 
relative t to the scope of vision. Even though "up" and "down" are equivalent in 
terms of the information they^convey ("up" can signify "not down", and vice- 
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versa), and the same, is true for "front" and "back", psychologically there is 
a preference for coding the locations of objects in a visual scene as positive 
if they are up and in front, and as negative otherwise* rhis makes sense con- 
sidering that things that are down or behind can become invisible (underground 
or behind tf" A ad) in the real world* There is also a preference to make 
comparisons in terms of the "unmarked" member of a pair of polar adjectives, 
which is the member that serves as the name of the dimension* That Jls, to ask 
"how high (or tail) is X" does not imply that either is high — whereas asking 
"how low (or short) is X" implys that both are low* The use oj. a term which - 
d^s not label <a dimension proper, then, will lead the reader to infer that the 
variations along the dimension fall on one (usually the "low") end* 'fhe brief 
implication for gr-ph design concerns the labelling of x's and other graphic 
elements and how they will be interpreted by the reader* If a dimension is 
labelled as shortness", "smallness", "farness", etc*, the reader will draw 
conclusions that would not be drawn if the labels read "tallness", "nearness", 
and so on* Furthermore, comparisons of quantities using words like "shorter 
than", "smaller than", "farther than",- and so on, will he harder to understand 
that the equivalent comparisons using "taller than", "larger than", "nearer 
than", and so on (Clark & Clark, 1968/ Clark & Card, 1969)* 

Conceptual alignment * Once one has chosen a visual dimension to represent 
a conceptual variable, how does one decide which way to "align" the two scales? 
For example, if the oil production of nations -s to be represented by the 
light-dark dimension, should the nations with more oil be colored lighter or 
darker than the nations with less oil? Cooper and Ross (1975, see also Pinker 
and flirdsong, 1980) have proposed a rationale for deciding, based on the lin- 
guistic phenomenon known as "freezing 11 * Most languar/es permit conjoined words 
i idioms to be spoken only in one order, for example, "here and there", but 
not "there and here"; "kit an. 1 , caboodle, 1 ' but ,iot "caboodle and kit"* Cooper 
and Ros^ outline a set; 0 f quasi-universal phonological ^d semantic principles, 
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experimentally validated by Pinker & Birdsong U979> that govern these "frozen 

orderings"* Of concern here is their suggestion 'that the first terps pf frozen 

conjoined phrases are all pore "psychologically central 11 than the second terms, 

and vice versa* This provides a large set of simple predictions of the "best" 

way to fix the- entipoint of a raent i J variable with . endpoint of a visual one* 

Since we say "more or less" and we also say "light and dark 1 ', "thick and thin", 

and *up and down" (but not vice versa in either case), one can link up the 

first terns of each phrase^ and the second terms of each phrase* This leads one 

to assume that more oil should be represenced by lighter shades, thicker 

boundaries, or taller bars; and less oil by darker shades, thinner boundaries* 

and shorter bats according to Cooper and Ross* 

At first glance, this- appears contradictory to one's intuitions, which 

would lead one to make an area darker to represent more oil Cor generally more 
* 1 ■ 

of anything)* It also seems contrary to the results of Cuff (1973)* Recall 
that Caff investigated the dif f erenpes *T5£tween qualitative and quantitative 
methods of shading of colors to convey to readers the desired impression of 
distribution with the least amount of effort* Children in grades 6 through 12 
were shown maps that symbolized the temperature of a given region using 3 color 
schemes: shades of red, shades pf Mue/*and shades of red and blue. The 
children were told to consider these jtfaps _o be temperature maps* and to mark 
the areas they considered to be of highest/ medium, and'lowest temperature* No 
legends were used* to see whether an effective color scheme conveys the desired 
information to the reader in a natural way without referej^e to a legend* 
oorest results were obtained with the two color map* It appeared tt>at the 
qualitative associations of strong red with warm and strong blue with cool (see 
the forthcoming section on synesthesia and cross model matching) were not 
enough to override a tendency to associate Hght shades with low temperatures 
and vice versa* Deeper shades of blue (as well as deeper shades of red) 
su^^ssfully symbolized warmer temperatures, despite the inapprGjjriateness of 
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the blue as a symbol of warmth, and despite the freezes "light and dark" and 
"hot and cold", which would make "light" the natural symbol for "hot", one 
would think * 

How can we interpret these anomalous findings? Recall that graphs and 
maps are almost always shown on white backgrounds, se* .ig as the context 
against which the marks are defined* Given this* it is natural to interpret 
the darkest shades as those with the more ink on the page* or the most filled- 
in, or the most marked* Higher temperatures ("high and low" or "hot and cold' ) 
are represented by more ink ("more or less", "filled and unfilled", "shaded and 
blank")* support of this conjecture, we have often noticed that lecturers 
will refer to shaded regions on a blackboard as "dark 11 or "black" and empty 
regions as "light" or "white" — even though the opposite is literally the easel 
Evidently, the "f illed-unfilled 11 dimension is more salient than "light-dark", 
and this is reflected in people's preferences for dimensional pairing according - 
to the freezing principle* Thus, filled regions should be used to represent 
the first term in a frozen ordering, ajid unfilled regions the second term* 
c* Consistency with Cultural Convention 

A graphic display should not violate common cultural conventions* Much of 
the way symbols are used is determined by simple convention* But once such 
conventions are established, it becomes defeating to try to ignore them or, 
worse yet, fly in the face of them* An effective graphic display, then, will 
not viola-te^cultural conventions* One such convention is that we normally 
associate the change with a movement from left to right presumably because that 
is the way we read* Thus, we are used to interpreting a line that goes up as 
it progresses from left to right as indicating that some quantity is increasing 
(in fact, this is even reflected in the way we described the line — note that we 
did not say that the line went down as i.t progressed froro right to left) 1 *- T^f 

*Note that this particular case may not be an isolated convention in our cul- 
ture, and may not be a convention at all. The freezing principles predict 
}ust these pairings, since we say "higher and lower", "more and less", "up and 
down", and "right and left"* 
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the same display were rotated 90 degrees, the line would seem to go down, 
directly violating the interpretation that would follow from the usual conven-* 
tion* 

Other types of conventions for rep~ renting information in charts and 
graphs are rampant* For example, "red" indicates "stop," not H go" (and vice 
versa for green), the direction of movement Sfcqund a circle is clockwise, and 
so on* Not only are there general conventions in the culture, but each disci- 
pline and subculture has its own special conventions* The greek symbol Sigma 
represents summation to statisticians and engineers, and probably should not be 
used to represent something else to these readers* 
2* Schema Avai lability * 

Understanding a chart or graph involves translating a visual pattern into 
a set of conceptual or quantitative relationships* To do that, the reader must 
have some general ^rasp of the conceptual/quantitative relations that the gr^ph 
is trying to convey, and he or she must know the translation scheme by which 
the visual marks stand for quantitative information (which will differ from 
graph type to graph type)* we have called this translation scheme a graph 
schema (see Chapter 6), and whether or not a reader has a graph schema and the 
concepts it presupposes will affect his or her ease in reading the graph* A 
good chart or graph, then, should not incorporate concepts the intended reader- 
ship will have difficulty understanding and should not use formats that are 

* 

unfamiliar to the intended readership* in t^iis section we consider the data 
that bear on these principles, and attempt to discover whether there are any 
data abo'jt vhich kinds of concepts should not be used for given types of 
people * 

a* Concept Availapility 

Wo one can understand a graph if he or she dues not grasp the concepts 
themselvft that the graph is trying to communicate* one example of a violation 
pf this principle occurs when the graph maker overestimates the sophistication 
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of the readership* For example* consider a graph illustrating the percent 
change in prices for wheat for each month of the year* The height of each bar 
represents this change/ and hence the entire graph constitutes a plot of the 
ratf* of change over the period* If this graph had appeared xn an elementary 
scL jl text, most of the readers probably would not understand it* If the 
readership does not understand a concept/ the information can still be presen- 
ted* But now the graph maker must use the more elementary concepts upon which 
the more complex one was built* In this case/ "change over time 11 for different 
months seems clear enough* But if one were to plot 12 functions on a graph, 
with the Y axis representing price and the X axis time* the principle of pro- 
cessing limitations would be violated* So, if the point is simply to show that 
the rate is changing over the year, four plots—one per quarter — would do* The 
reader need only note the fanning pattern of the functions to get the message* 
Of course, if the readership is more sophisticated it may be better to use the 
more sophisticated concept/ and save the additional ink* 

There is either an embarrassment of riches or a depressing dearth of sys- 
tematic research on this aspect of the principle of schema availability/ 
depending on how you look at it* On one hand/ virtually all the work on chil- 
dren's concepts can be taken to bear on the principle/ but on the other hand 
there has been virtually no work systematically examining what percentage of 
different segments of the population are comfortable with concepts that con- 
ceivably could be used in a graphic display* Some of the work on children's 
concepts does in fact bear directly on the kinds of concepts necessary to 
understand charts and graphs* In particular, Piaget <see Flavell/ 1963) has 
provided us with much information about children's mathematical/ logical and 
conceptual competences* Among tuo relevant research is Piaget's investigation 
of how ard when child comes to realize that various objects differing along 
soma quantitative dimension can be interrelated by the concept of a scal<» or a 
series (see Piaget, 1967)* For example/ very young children cannot arrange a 
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set of sticks in order of increasing length* if this really reflects a lack of 
competence for reasoning about scales, then it seems likely that until that 
awareness develops (around 5-7 years for most domains, according to Piaget) the 
child will have trouble relating the curve on a gra- ' or an arr' , of bars to 
the concept it depicts* The conceptual abilities ot the child axe beyond the 
scope of this book; here we simply wish to point out the obvious relevance of 
this research to the problem of what children can be expected to extract from 
graphs at all, and refer the interested reader to Piaget's books on space, 
number and merging (Piaget, 1954, 1956; Piaget and Inhelder, 1971; see also 
Gelman and Gallistel, 1979, for further developments and some cogent critiques 
of some of Piaget's work in these areas)* 

Note that there will also be cases in which a reader possesses the con- 
cepts involved in a graph's message, but does not realize that the graph is 
communicating those concepts because the words used for labels are not in that 
reader's vocabulary* Symbols should be used that will be easily understood by 
the presumed readership* Most academics, and some people in the business com- 
munity, seem afflicted with a desire to use unusual words or symbols in place 
of more familiar ones, often resulting in a violation of the principle of 
schema availability* if a chart or graph is to be presented in a publication 
that is directed to a specific, well-defined readership, some jargon may be 
appropriate to prevent the use of lengthy locutions; however, as a general 
rule— and especially if the readership is not precisely def ifned— jargon in 
labels or special symbols should be avoided* 
b* Graph Schema availability 

A person must understand the conventions and notations used in a graphic 
display in order to comprehend it* If a person has never seen a given graph 
type, it p.esents a problem to be solved, at best (if the person is highly 
mo ,iv&t<id), or a road block, at worst* In fact, it has been shown that a very 
common response to the task ot analyzing an unfamiliar type of display is to 



153 

15o 



( 



9 



ignore the display altogether (Wright & Threlfall, 1980)* Thus it seems advis- 
able to stick to conventional formats for graphs unless the pattern of informa- 
tion to be communicated is so subtle/ unusual/ or complex that no such conven- 
tional graph can cc '->v the pattern simply (e*a the novel graphs for the 
presentation of statistical properties/ discusseu at length in Tukey (1977) and 
Wainer & Thissen (1981)* Furthermore/ it is important to know what sorts of 
graph types are most effective for what ^orts of people* But as we have seen 
in the previous sections/ even this goal must be qualified: the effectiveness 
of a chart or graph also depends on the uses, to which it will be put* Thus/ we 
must take care to evaluate graph effectiveness for different populations per- 
forming specific tasks* Researchers in the field have recognized this/ and 
many of the studies in the literature address this topic* Unfortunate Ty, the 
research generally is so flawed that not much can be inferred from the results* 
Improvements in methodology and theory have rendered most of the previous work^/ 
uninterpretable* In this section/ we will briefly review the most widely-cit^d 
studies in this genre, and will briefly critique them/ referring the reader to 
MacDonald-Ross (1977) for discussion of additional studies* If nothing else/ 
this exercise should be useful to readers who plan to do original research 
themselves or who plan to read the primary literature in detail* 
A* Developmental Research and its pitfalls 

A number of studies have examined how well children of different ages and 
grades comprehend charts and graphs* Mathews (1924) gave children a represen- 
tative sampling of graphic materials that pertained to one particular course of 
study, social science* Various forms of bar, line and circle graphs, time 
lines, and pictograms were included in the study* Ihe measure o£ difficulty 
was the percent of objective questions correctly answered by the -members of 
each $i;oup* The results are difficult to interpret, however (as the author 
himself acknowledged), because there is no way of telling which specific com- 
ponents of the materials were responsible for ease or difficulty of comprehen- 
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sion* The type of graphic components that the author varied included the orien- 
tation of bars (horizontal or perpendicular), number of shades on bars (i*e* if 
each bar represented more than one variable, if they were segmented), number of 
lines (on line graphs) or divisions (on circle graphs) and the presence or 
absence of scales* Thus, although Mathews found that the circle graphs werv 
easiest, with bar and line graphs being progressively more difficult, these 
results are suspicious* Critically, different questions were used with differ- 
ent displays* Thus, the observed differences may have nothing to do with dif- 
ferences in the graph types per se, but only with differences in the difficulty 
of the various questions* There was no attempt reinvestigate systematically 
which types of graphic displays are better suited for deriving which type of 
information* 

Another study that focused on the development of the comprehension of 
graphs is reported in Strickland's thesis (1938), in which children in grades 
1-4 were taught aspects of the history of technology using various sorts of 
graphs* The question of interest was: - which forms of graphs are suitable fcr 
each of these four grade levels? A graph was considered "suitable" if children 
at a given level answered questions at a level of accuracy exceeding chance 
performance by 30%* She concluded that first graders were "ready 1 * to learn, 
from unit pictographs and from developmental picture charts (a series of pic- 
tures depicting some characteristic of successful epochs, such as means of 
transportation)* She also found that second graders are "ready" to learn from 
circle graphs as well, and so on up the academic ladder* Bowever, given the 
arbitrariness of her criterion, it is hard to take such conclusions seriously, 
especially since all grade levels responded above chance accuracy for all types 
of graphs* 

Of more potential interest is Strickland s ranking or different graphs in 
terms of accuracy of understanding by different grade levels* She found that 
the ranking of graph types did differ from one gr4d« level to another, which at 
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first glance seems to suggest that there are Interesting differences in the 
development of different cognitive components (as opposed to a simple monotonic 
increase in attention or other skills, which would have left the ranking intact 
over grade levels)* Unfortunately/ different grades were p*- -anted with dif- 
ferent examples of each type, of graph, roughly adjusted to tue children's level 
of skill, thus making direct comparisons of graph types across grades impos- 
sible*, In fact/ the number of correct answers overall did not increase with 
increasing age—confirming that the corresponding graphs for different grades 
differed in intrinsic difficulty* Nonetheless/ there were some consistent 
differences among graph types for all ages: line graphs were consistently 
harder/ wheceas "developmental picture charts" (series of pictures exemplifying 
a historical trend) were consistently easier* However, the different formats 
shown to children depicted different sets of data, preventing us from knowing 
whether it was the line graphs per se or the particular information set that 
Strickland chose to depict by line graphs that led to poor performance* 
Finally/ Strickland ranked the kinds of information depicted in graphs in terms 
of how early children could consistently answer questions about them* She 
found that as children aged, they became better at answering questions that 
compared several units of information as opposed to single units/ at reporting 
absolute quantities and precise rates as opposed to relative amounts (i*e*/ 
"greater than" and "less than")/ at deducing the purpose of the graph/ and at 
reasoning about the information in the graph* Again/ the value of these obser- 
vations is dubious — they could indicate developmental change in attention* 
interest* memory/ reasoning power/ perceptual acuity/ or a combination of these 
factors - 

■Other developmental studies of graph comprehension yielded scattered find- 
ings, Vernon (1950V reported only that performance was poor overall; Ve2in 
(1974) found improvements in performance with age, and overall advantdqo*; for 
concrete over abstract material; Halter (1948) found that younger children are 
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poor in recognizing conventional symbols (such as arrows symbolizing movement) 
in diagrams; washburn (1927) reported no effects of age among junior high 
school students; and Zwaga and Boersma (1973) reported a slight advantage of 
young adults over older adults in r mizing stylized symbols for railroad 
facilities* Both Vernon (1950) and Strickland (1938) failed to find correla- 
tions between children*s performance for a type of graph and theii accuracy in 
reporting information from it* Unfortunately, as this heterogeneous collection 
of studies attests^ nothing even resembling a systematic approach examining 
developmental changes for each cognitive component involved in the comprehen- 
sion process has been attempted* 

B* Individual Differences Research and its Pitfalls 

Vernon (1946, 1952) has investigated, in a qualitative way, the retention 
of material from graphs, testing the hypothesis that graphs will succeed where 
other methods fail in educating the "man in the street"* She presented sub- 
jects of varying levels of education with sets of demographic data graphed in 
various ways and asked them to answer questions or write paragraphs about the 
graphs* she was impressed by the generally sketchy and inaccurate recall of 
*ata, and by the subjects' failure to draw logical, coherent conclusions from 
the data depicted* she also found that recall increased with the educational 
level of subjects* 

Aside from dispelling the naive notion that graphs are a panacea for ig- 
norance/ Vernon's studies are of little interest, since again/ they measure the 
effects of a large number of cognitive processes acting in concert* For one 
thing, it appears that many of the subjects simply did not understand the Ques- 
tions as the experimenter intended them to be understood* For example/ they 
frequently described the superficial visual appearance of the graph instead of 
the data contained in it, or they answered questions on public policy accorJinij 
to their own opinions instead of according to tnw narrow implications strictly 
suggested by the data in the graph* In addition, subjects had to answer quss- 




tions from memory instead of responding in the presence of the graphs* Thus* 
they could have perceived the data from the graph perfectly well/ but could 
have forgotten it rather quickly — a plausible interpretation, given the proba- 
ble lack of interest tf \ the data on the part of the subjects* in fact/ the 
memory requirement ca,. distort not only the absolute amount of information 
gathered from a graph, but also the differential effectiveness of different 
types of graphs/ or of graphs as compared to other media* 'ftiese possible dis- 
tortions could be due to the fact" that different sorts of displays may be en- 
coded in different formats in memory (such as images or words, which may decay 
at different rates)* Also/ because of the responses required of the subjects/ 
graph comprehension was confounded with general verbal ability* Finally, the 
effects Vernon observed of education and intelligence on retention are also of 
negligible value; the differences that were observed consisted simply of lower 
retention by the less educated/intelligent subjects* This could betray differ- 
ences in interest, attention/ memory/ comprehension of instructions, comprehen- 
sion of graphs, knowledge, ability to reason about the information conveyed by 
the graphs (e*g*, how to derive the rate of increase in a population knowing 
the birth and death rates), or some combination of these factors* Further, 
socio- economic status is confounded with intelligence and education level in 
this study* Thus, the Vernon experiments are a good demonstration of the 
pitfalls that can be encountered in testing comprehension of graphs when the 
different cognitive components are not considered separately* Later in this 
book we will provide an analysis of these components and make suggestions about 
how this analysis could guide further research* 

There is also a dearth of research on other indiv* ual difference varia- 
bles* Strickland (1933) found no sex differences among subjects, though Vernon 
(1950) found boys were more accurate than girl;., i n accord with the large lit- 
erature suggesting that boys are better than girls at spatial and quantitative 



reasoning (see Maccoby* 1966)* Given the problems with this research, however, 
even this result must be taken with a grain of salt* 

Like the developmental studies, the individual difference studies would be 
nearly useless even if they had yielded reliable resets* "Main effects" of an 
individual difference variable (i*e*, across the boaw advantages or disadvan- 
tages) are basically uninterpre table/ since they could reflect differences in 
a host of variables, such as knowledge/ memcry, attention, or interest* Once 
more it must be stressed that the target of such research must be a characteri- 
zation >f differences in operating characteristics of different cognitive com- 
ponents in different populations/ for example/ boys might be less prone than 
girls to organizing figures according to the Gestalt Law of Common Fate* 

II* Formal Principles 

There are many places for potential slips 'twixt cup and lip 1 in reading a 
graphic display/ and a major one lies in the link between the actual marks and 
the literal meaning drawn from them* The reader will be attempting to trans- 
late each visual element on the page into soma conceptual entity or relation- 
ship/ and this will be difficult or impossible if there is not proper mappings 
between marks and concepts in the graph itself* We have fornwlated two formal 
principles that seem to capture the critical ingredients of a correat mapping 
and which/ if violated/ result in an ambiguous or misinterpreted display* 
A» The external mapping principle 

Every mark should map into one and only one semantic category, and every 
piece of information necessary to read the intended information should be indi- 
cated unambiguousl/* The first part of this principle corresponds to Goodman's 
criterion o* disjointedness, and the second to his criterion of dif ferenti 3- 
tion (see Chapter 2)* This principle will be violated if a mark is ambiguous 
(such as a specifier bar that could be interpreted as containing two abutting 



segments or one longer part with a smaller one laying over part of it) or a 
necessary set of marks is mining (such as numbers demarkating a scale)* The 
important distinction between this source of ambiguity and ambiguity that can 
arise due to ope- 'on of the syntactic principles (e c <*cially those pertaining 
to discriminability and grouping) is that thSre is no jeometric or pictorial 
transformation of the existing graph that could correct this sort of formal 
ambiguity* For example, if ambiguity arises because a label is equidistant 
from two axis, and hence is grouped equally well with each one, this could be 
corrected simply by moving the mark* But^if the mark is inherently ambiguous 
or missing, no amount of rearranging the existing display will correct 



The importance of violations of this principle is context-bound to an 
unusual degree, as virtually any continuous function or axis in theory violates 
it (see Chapter 2)* In these cases, it is impossible to identify any given 
location with absolute precision— no matter how one alters the graph (blows it 
u^etc*)* But in virtually all cases; absolute precision is not necessary for 
the reader to get the intended message* in fact, excess precision will get in 
the way* For example, marking off 1/l00th of a gallon of oil production on a 
scale is superfluous if the reader is supposed to see how production changes 
per year — and will tax reader's limited capacity to prbcess information* 

Similarly, whenever a picture is used as a label, this principle is tech- 
nically violated* Any picture can be assigned an infinite number of interpre- 
tations, in theory* A picture of a sitting man, for example, could be a pic- 
ture of a man's J^ead, bent knees, John, a sitting Caucasian, etc* But if the 
graph maker obeys the elementary principles of symbclization we outlined when 
discussing the principle of surface compatibility, this should not in fact be a 
problem; th- correct picture will be given only a single label by virtually 
all readers* 



matters* 



160 




T 



Consider tue graph shown in Figure 4*5, which shows weight gain for labor- 
atory animals over time for two different kinds ,of food* The variables repre- 
sented in this graph are weight/ time and experimental conditions. Note that 
the value of the treatment variable is represented by the level of the dashed 
line* After week number six, however, the value of the treatment variable 
changes from "Test* 1 to "Control 11 * This difference in the value of the variable 

is not represented by a difference in the specifier mark (the line representing 

* 

the function)* Thus, the vertical mapping principle is violated; a meaningful 
difference in what is represented is not indicated by a difference in the 
mark* 

INSERT FIGURES 4*5 AHD 4*6 HERE 
This principle is especially important when a reader is supposed to be 
able to -assign values to discrete categories of some kind* If the categories 
themselves are not represented by discrete marks, this will be difficult, if 
not impossible* Figure 4*6 illustrates two ways of presenting the same infor- 
mation, the graph on the left violating the present principle* If the reader 

t 

is supposed to be able to discern which color will be associated with which 
temperature, the chart on the left is clearly inadequate* 

INSERT FIGURE 4*7 HERE 
Consider Figure 4*7* This kind of display is common when one wants to 
display additive components of a set of numbers* Figures 4*7a and b shott two 
ways in which the physical mark can be interpreted* One interpretation is 
that the mark is a compo&ite of two contiguous marks, labeled in the graph "x f ' 
and "y"* Another interpretation is that the mark is a composite of two over- 
lapping marks, with "x" included in ,f y M . Note, then, that depending on how the 
physical. mark is interpreted different sorts of information will be inferred* 
By using the format of 4*7a, it is cle?c: that stopping distances is being 
assessed only after braking has begun, whereas by using the format of 4*7b, it 
is clear that the total stopping distance includes the time to begin braking* 
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Thus, a formal ambiguity requires not just re-arranging or re-scaling parts of 
a display to correct, but mare fundamental changes in how information is 
presented* 

In principle, the most severe violations of the external ^ing principle 
occur when a basic graphic constituent is missing* Figure 4*8 presents some 
common examples one often sees on blackboards* Without the framework, a perscn 
not privy to the coversational context of the gra^h cannot know the baseline or 
variation along the relevant scales* But recall that one of the basic ideas of 
our approach to graphic design is the notion. of purpose-specificity: depending 
on the purpose of a chart or graph, certain information will be required and 
other information will be superfluous* So even here, if only a trend were 
required, and a verbal context provided the relevant background, even the 
quasi-graphs of Figure 4*8 could be adequate* 

INSERT FIGURE 4*8 HERE 

Having ^id this much, it is necessary to point out how the external map- 
ping principle is related to the principle of surface compatibility* Some 
aspects of the principle of, surface compatibility, the reader will recall* 
hinge on marks being interpreted both as a depiction and as a symbol* For 
example, marks could serve to delineate a framework of a graph on rising gold 
prices while at the same time depicting a bar of gol£* Ox, a mar k caj ^ serve as 
a specifier (a bar in a bar graph) while also depicting a tree* The vertical 
mapping principle applies separately to the role of a mark as a symbol and the 
role of a mark as a depiction* In both cases the interpretation should be 
unambiguous, and tfre clear interpretation of the meaning of the symbol does not 
Guarantee the clear 'i nterpretation of the depiction and vice versa (e*g.* the 
marks may serve well as a framewoVk but be confusing a5 a depiction or vice 
versa)* 

0* The internal mapping Principle 

The correspondence between portions of a display should be unambiguous. 
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The foregoing principle was concerned with the direct interpretation of the 
meanings of marks in isolation* This principle is concerned with the interpre- 
tation of the relations among marks — between specifier elements and labe3^ 
between axes o** -^mework constitu and specifier elements/ between sub- 
graphs and the nuiin graph/ and so on* xt .is po^ible to have perfectly inter- 
pretable marks for which the interrelations are not clear* Given that ail 
charts a nd graphs communicate information by displaying some kind of mapping 
between entities — either between different quantities or different qualities — 
the necessity for easily-read associations is obvious* And yet, it is very 
common to discover cases in which the relations among different parts of the 
display are not clear* For example, consider the graph of figure 4*9a, showing 
levels of Dow Jones Industrials from 1927 through 1937* The insert represents 
a magnification of a portion of the display, which is indicated by the bracket* 
Note that the portion indicated by the bracket does not exceed the 350* level, 
Dut the corresponding insert represents a portion extending beyond the 350 
level* The puzzlement caused here is clearly eliminated in figure 4*9b* 

INSERT FIGURES 4*9 AND 4*10 HERE 

Consider now figure 4* 10a, which presents a graph used in a textbook on 
physiology* Here we see six groups of bar elements representing six different 
ohysiological and pathological states* Each group is composed of three ele- 
ments and eacn of these elements represents a particular property or component 
of blood* Note, however, that two vertical axes are present at the left of the 
graph* Each axis is associated with a numerical scale and a label indicating 
the relevant units* But tfie association between #ach axis and the relevant 
bars is nissing; we don't know to read the meaning of the individual bars* 
Consider how much easier it is to interpret the chart when the principle is 
followed and it IS correctly laLaled, as is illustrated in Fi<jure 4*10b* 

Th* foregoing examples were intended to provide clear illustrations of 
violations of the internal mapping principle* However, for other typtss of 
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graphs these violations may be less obvious at first glance* Consider the mul- 
tiple framework display shown in Figure 4*11, which illustrates receipts for 
particular services from the years 1976 to 1979* Were the receipts for Busi- 
ness Services in 1979 greater than ' ^ receipts for the Hotel/Motel group? If 
you answer by looking at the specifier, the answer would be "no"* Howv. ver, the 
two vertical axes do not use the same scale values* Thus, in fact ti^e answer 
should have been "yes"* In this case there is a faulty correspondence between 
two elements of the frameworks of sub-graphs* Not^, however, that this problem 
is very much bound to the potential use o£ the graph* If the display wer<* 
intended only to allow one to compare relative trends over tin: , then there is 
no imped inent (although, technically speaking, a violation exists)* Figure 
4*12a and 4*12b present alternative ways of illustrating this same data which 
clo not f all *v to the violation of this principle* Note, however, that in 
normalizing the scales other things are lost (such as an ability to read easily 
the variations among the less profitable businesses)* , 

INSERT FIGURES 4*11 AND 4*12 HERE 
So, the message again is clear* Once one is aware of the sources of po- 
tential problems with respect to a given purpose, it is usually easy to see how 
to circumvent them--thoagh the graph still cannot be ail things to all readers* 

III* Pragmatic Principles 
j 

The principles reviewed thus far have been concerned with how charts and 
graphs convey information as complex symbols* As such, we have considered how 
the marks on a pa^je are analysed and grouped by the perceptual system and how 
th* lit^rAl meaning of the^t* marks is assigned* But conprehendintj a chart or 
gtMph involves more than merely assigning a literal interpretation to symbols, 
jUTt as understanding language involves mor*? than interpreting each word and 
the relations among them Literally* As is the case with linguistic utterances, 



164 



ft 



graphic displays occur in a communicative context* ^fie reader is expected to 
draw inferences and to be sensitive to connotations that are not explicitly 
present* These indirect statements can either be accurate or misleading, and 
tne graph mahor may sometimes intentionally make a "political statement" with a 
misleading connotation* We will not comment further o ^ principle ^t 
deception is unethical, but will only point out the dynamics of how charts and 
graphs coiae to convey information indirectly* Giv^n an understanding of these 
dynamics the chart or 9raph maker can be aware of the potentials for inadvert- 
ent deception--and the chart or graph reader can be alerted to detecting cases 
in which he or she may be systematically misled* 

We shall consider two general classes of pragmatic principles* The first 
pertain to the inferences we are invited to make when viewing a display* and 
the second pertain to the effects of context on how we see and comprehend a 
display* 

1 Invited inference 
If one is asked, "Can you open the door?," one does not say "yes" and 
leave it at that (pesky thirteen-year olds excepted); rather, one opens the 
door But strictly speaking, literal interpretation of the utterance is that a 
question is being asked* What is happening here is an example of pragmatic 
factors at work in the comprehension of language* One draws an inference above 
and beyond the literal meaning* We can outline a number of ways in which 
charts and graphs can be constructed to induce readers to draw particular 
inferences* Many of these devices are discussed at length in Huff's excellent 
book, Kow to Lie with statistics * The basic idea of all of these manipulations 
is the same: use physical properties of the display in such a way that the 
description of the display itself will exaggerate or downplay specific informa- 
tion* 

INSERT FIGURE 4*13 HERE 
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Labels 

The words and phrases n^ed as labels can dramatize a point* In tho first 
p^nel of Figure 4*13, the title makes an unemphatic statement about the content 
of th^ graph* In the second, the word "incret is replaced by the word 
"soars", bringing to mind a set of connotations not implied in the first place. 
Now Lt ls taking off* In the third version the word "inflation" is replaced by 
the words "runaway inflation"* Now it is not only taking off, but dragging 
everything with it as it careens away! Thus, although the same information is 
presented in the display, the way it is labeled affects the way it is inter- 
preted* Though these differences may seem trivial, their effects are not* 
Elizabeth Loftus 0979) has found that when people witness an event, subtle 
changes Ln the wording of questions asked afterward have considerable effects 
in people's recollection of the events* Thus, when asked a question like "How 
fast was the Ford going when it smashed into the VW? 1 ', people give higher esti- 
mates of the Ford's speed than people asked the more neutral "How fast was the 
Ford going when it collided with the VW?"* Furthermore, the first group/ but 
not the second, mistakenly "remembered" having seen broken glass in the 
original *avent* Little words matter* 

Another way labels can be used to give the "wrong" impression is through 
their absence* As uuff (1954) points out, if one has a small effect and has 
expanded the vertical scale to amplify it one can conceal this fact through the 
simple device of failing to label the units and the scale* (Of course, this 
violates the external mapping principle, but even if the graph were unambig- 
uous, dte reader might draw the wrong inferences if the axis labelling was not 
salient, or if the reader did not have mastery of the conceptual distinction 
between absolute differing* and proportional difference * 
franowork Variations 

Scale Units * The use of different scale units is another way in which a 
reader can be led to draw inferences, and w as one of the chief ways to lie with 
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statistics that Huff documented* Because our cognitive systems encode not only 
the magnitudes of physical continua* but also sort them into a small number of 

^ discrete categories in memory (Miller, 1956; Kosslyn, Murphy, Bemesderfer & 

Feinstein, 1978), once a visual mark is expanded to a certain size* it will b 
"bumped" into a new perceptual category/ and may be represented internally as 
"tall" rather than "medium" or "4 inches high". And, if Pinker's conjectures 
in Chapter 6 are correct/ people familiar with a given type of graph will tran- 
slate perceptual categories such as "very dark" directly into conceptual cate- 
gories like "very large"* Thus, a gradual change in a physical continuum, such 
as would be accomplished by stretching an axis, may be encoded in short-term 
memory as if it were a quantum change/ which would then be translated into a 
quantum change in the quantitative message that the reader carries away* Con- 
sider the difference in the apparent increases in figures 4*14a and 4*14b, 
which vary only in the amount of compression along the vertical axis* in gen- 

^ eral, when one selects a large scale unit/ one is implying that the amount of 

increase is smalli conversely, one amplifies an, effect by selecting a larger 
vertical axis, spreading out the scale units* Ifte use of logarithms instead of 
linear units can have similar effects* 

INSERT FIGURES 4*14 AND 4*15 HERE 
Truncation * The way a difference appears can also be manipulated in a 
graph by truncating the vertical axis and expanding the portion of the scale 

e 

that remains* The two graphs in Figure 4*15 represent the same information, 
about numbers of US and USSR missiles* But the one on the left was drawn by a 
SALT II proponent and the one on the right w s drawn by a SALT II opponent* By 
deciding to begin the scale at 100,000 (a number we just made up, by the way), 
we could spread out the remainder of the seals — amplifying the apparent differ*- 
ence* 

As poet . Ono of the most powerful ways of slanting a giv^n graph (if you 
will forgive the pun), is by altering the aspect of the axes* or ratio of their 
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scales* Figures 4*t6a and b show alternative presentations of the same set of 
data* In Figure 4*1^ the ratio of the vertical axis length to the horizontal 
axis length is 2:1, whereas in Figure 4*16b the ratio is 1:2* itote that the 
increasing trend »** the data is much more strik' ■ in the first graph* 

INSERT FIGURES 4*16 AND 4*17 HERE 
3-D * If a framework is made to project at an angle in space (e*g*, it is 
"painted" on a wall that one examines from an extreme angle), the foreshorten- 
ing that results can emphasize or de-emphasize a trend* This is because we do 
not perceive line drawings of extreme perspective projections accurately 
(Kubovy, in press; Hagen, 1981)* For example, in Figure 4*17, widely dissemin- 
ated by the Reagan administration in 1982, consumer prices are seen to take a 
noticeable drop* However, the decline appears far less impressive when we 
consider that the drop is only 0*3% of he CPI, an amount whose tininess is 
obscured by the fact that it is expanded in perspective in the extreme perspec- 
tive view of the graph depicted* 

INSERT FIGtJRE 4*17 HERE 

Specifier 
Depictions * 

Numerous inferences can be invited by different depictions serving as the 
specifier in a chart or graph* In a graph that presents the number of annual 
traffic fatalities over a decade, ordinary bars would suffice to present the 
data, as is shown in Figure 4*l8a* But the implications of those data are 
really brought home when the bars are replaced with stacks of human skulls, a s 
is evident in Figure 4*1 8b* 

INSERT FIGURE 4*18 HERE 
Correlated Variations of Integral Dimensions * 

I- is possible to create an impression that a trend is increasing more 
than it in fact is, by talcing advantage of the fact that vertical extent, 
horizontal extent, and extent in depth are integral dimensions* Thus, the 
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value on one dimension cannot help but affect the value on the other* In using 
a bar graph, then, expandin he width of the bars as their height increases 
will leave the impression that the increase is more pronounced than it actually 
is* Similarly, if pictograms are used (in which a picture serves the role of a 
bar), the size of the entire pictui can be varied as well as Just the extent 
along the relevant dimension* In addition, using heavier lines as bars 
increase in size will underline the increase, as will shading them darker as 
they become longer* These variations can be combined in any order to calibrate 
how much distortion thtre will be in the impression conveyed* 
Selective Reporting 

Many charts and g^phs are idealizations, omitting details that are con- 
sidered unnecessary for the intended purposes* This principle can be carried 
to extremes, however, as is illustrated in Figure 4*19* In the left panel is 
graphed the complete set of data, revealing an inverted □ shaped function over 
time; in the right panel is graphed only part of that function, revealing an 
increasing trend* if the rightmost graph is correctly labeled as presenting 
data up to only a specified time, it is literally correct* But as an "id ^li- 
zed" representation of the trend, it is misleading, since the reader will most 
likely interpret the abscissa of the graph as denoting a representative inter- 
val taken from the scale of interest, and hence will falsely conclude that one 
variable increases with the other in the general case* 

INSERT FIGURE 4*19 HERE 
2* Contextual Compatibility 

Most graphic displays occur in some context, either in t^xt or as part of 
a discussion* Depending on how the material being graphed is conceptualized 
prior to seeing the graph, a given display may be more or less appropriate* 
The message here is simple: a spontaneous description of a chart or graph 
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should not conflict with the description generated on the basis of contextual 
factors* 

a* Compatible Inferences 

The connotations of the written or spoken ^ ^kc L >und should be compatible 
with those of the display* For example, if the text dtates that "The price of 
gold soared to $1040 an ounce", the axes of the graph should be constructed 
such that the function seems to soar* Compare the two graphs in Figure 4*20; 
which is most compatible with the foregoing statement? 

INSERT FIGURE 4*20 HERE 

b* Compatible Terminology 

The labels in a graphic display should not use different terminology than 
is used in the text* 
c* Compatible Discourse 

A graph should not present more or less information than is required for 
its specific purpose* More information will distract or confuse, and less 
information will defeat the purpose of- the display* 

Thus, we vjj have considered all of the principles gleaned from the 
psychological literature and generated via our analytic scheme* In the next 
chapter we will use the scheme to discover the most common kinds of problems 
with graphic displays* 
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CHAPTER 5: USING THE ANALYTIC SCHEME: A SHORT FORM AND &CME RESULTS 



Our diagnostic scheme as presented in Chapter 2 performed as promised: i 

reveals problems with charts and graphs in a systematic and well-motivated way 

s 

Furthermore/ because of its exhaustiveness and attention to detail it helped 
to induce many of the operating principles discussed thus far* However, 
although its thoroughness was necessary in the beginning, this characteristic 
becomes a serious impediment to using the analytic schema on a routine basis* 
Clearly a shorter and more directed form of diagnosLic instrument is required* 
This chapter presents a new version which- takes the form of a questionnaire* 
In the following pages we discuss the development of this new version; we high 
light its advantages in terms of usefulness to the general graphic practi- 
tioner, demonstrate its application to the two graphs which were used to intro 
duce Chapter 1, and finally discuss the results of applying the questionnaire 
to a substantial and representative sample of charts and graphs* 
Development of the Questionnaire 

The original scheme is exasperatingly long and must be applied by someone 
who is thoroughly familiar with the theory developed in the foregoing chapters 
In addition, it can only be applied to a graph which is already in existence 
and only through repeated use on many graphs can one become familiar with the 
more likely kinds of violations of operating principles* Thus/ the original 
scheme is neither a practical way of analyzing charts and graphs nor is it a 
very useful tool for teaching one to become a better graph designer* 

The motivation for developing a questionnaire format was threefold* 
First, this fortait reduces thr? amount of work, in general, and is easier to 
use — especially for unpracticed people* One need only attend to the particula: 
areas addressed rather than to construct a complete description of the graph/ 
as was neasured by the old schema. Second/ the quastionnoire offers a conven- 
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ient means of summarizing our experience in critiqueing graphs for the reader* 
The questions it contains were formulated after carefully reviewing many charts 
and graphs and identifying the most likely ways that particular graphic 
constituents (i.e., frame, specifier, etc*) will violate each operating princi- 
ple* The questions is therefore a convenient , of passing along the 
benefits of our experience to the reader and, as we shall soon demonstrate, 
also provides an effective tool for troubleshooting existing charts and graphs* 
Third, since the questions comprising the questionnaire summarize the more 
common ways that a graph maker can go astray, this instrument itself can serve 
as a learning aid* 

Because the sample of charts and graphs that were used as test casas 
played such a central role in the development of the questionnaire, the sam- 
pling plan by which graphs were selected merits some discussion* The sample 
had to serve two purposes* First it served as the basis for developing a 
questionnaire that was applicable to a wide variety of charts and graphs* 
Second, application of the final version of the questionnaire to a subset of 

i 

this sample was to yield information on the incidence of operating principle 
violations of various types in different broad categories of displays* In 
order to acconplish these goals the sampling scheme first had to be completely 
independent of our operating principles to insure that further development of 
these principles was not biased by the sample per se* Second, the sample had 
adequately to reflect the diversity of charts and graph- *ith which a reader 
may come in contact* 

INSERT FIGURE 5*1 HERE 
The sampling plan is shown in Figure 5*1* Note that this plan considers 
four basic aspects of charts and graphs* *phe first of these is t^e general 
field or content area of die publication that contained the graph* We included 
six broad categories of content area; math, physical science, life science, 
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social sciences, business and "General Interest", which is a cacch-all category 
containing such items as magazines* newspapers and "How to" books. The second 
aspect considered by the sampling plan is the age level of the intended reader- 
ship. This includes three mutually exclusive categories: pre-secondary, 
secondary, and adult* The third aspect considers *:ne general format of the 
publication m which the graph appears* For adults we include journals, text 
books and general reading, whereas for younger readers, the source format is 
restricted to text books and general reading* The fourth and final aspect 
considered by the plan is the visual format of the display* The four cate- 
gories of visual format are bar graphs, line graphs, pie graphs, and other 
graphs* Charts of any type were classified in the other graph category* 

This sampling plan yielded 152 cells* Our initial intent was to find 
several graphs for each of these cells, however, after collecting over 300 
graphs we were left with 77 empty cells* Most of these occur in the non-text 
book source format and in the nonadult age categories* After a heroic effort 
to fill the empty cells we concluded that charts and graphs were very infre- 
quently used in these situations* 

In using the sample for questionnaire development we first perused the 
entire sample and selected about twenty of the seemingly most problematical 
graphs* We then proceeded to' isolate the particular operating principle viola- 
tions in each graph, noting the graphic constitutents involved and the manner 
in which the vio?ation occurred* These twenty "bad" graphs provided the 
foundation on which the original set of questions were framed* We continued 
to analyze more graphs selected from other cells of our sampling scheme to 
ensure wide applicability of the final questionnaire. In all/ over ninety 
graphs were analyzed during the development of the questionnaire* As we dis- 
covered new ways in which operating principles were violated we addsd new, 
questionc or rephrased existing questions so that they would draw attention to 
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violations if they exist* Ifte questionnaire development was thus an iterative 
process of continually fine-tuning questions* making some more general/ others 
more specific, until there was some assurance that most violatiofts would be 
caught* There obviously is still room for improvement, however, *s we demon- 
strate in the next section of this chapter, the current version juj an exce* , ^nt 
compromise in terms of problem areas detected ^pr the amount of time and effort 
expended by the analyst* 

/ 

/ 

Questionnaire Structure 

The questionnaire is presented in the following section* It consists of 
two modules* The first and larger of these is concerned with single framework 
graphs or the individual subgraphs of a multiple framework display* The second 
module contains questions on the relationships between the various subgraphs of 
a multiple framework display* Each module is organized into three parts* The 
first contains questions pertaining to operating principles at the syntactic 
level* Ifte second contains questions at the formal and semantic levels, and the 
third is concerned with pragmatic operating principles* Table 5*1 shows the 
number of questions in the questionnaire which pertain to each operating 
principle* 

INSERT TABLE 5*1 HERE 
Each of these divisions into levels of analysis is further organized by 
graphic constituent and combinations thereof* At the syntactic level each con- 
stitutent may be treated in isolation since syntax is concerned only with the 
processing of marks on the page* At the formal, semantic and pragmatic levels, 
however, the meanings and implications of constituents are determined by rela- 
tionj between them (i*e*, how does the specifier operate in conjunction with the 
framework) as well as by the constituents thtrcselves* Table 5*2 shows the 
number of questions in module 1 dealing with each constituents or combinations 
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of constituents* Module 2 contains ten questions, all of which deal with the 

\ 

relationship between subgraphs in multiple framework display^ 

INSERT TABLE 5*2 HERE \ 

For the most n-rt the structure described above is maintained throughout 
the questionnaire, . owever* there are one or two instances where a question 
concerning a principle at one level is placed among questions at a different 
level of analysis* An example of this is question 49 w,uch concerns the syntac- 
tic principle of perceptual distortion* This question is included in the seman- 
tics section of the questionnaire because the reader is not likely to realize 
that this principle has been violated until he or she tries to assign meaning to 
the specifier* Perceptual distortion is truly a syntactic issue, however, since 
it is concerned with how the perceptual system processes a visual form* 

in module 1 there are three spaces' provided after each question* These are 
to be used to record independent responses for as many as three subgraphs in a 
multiple framework display* Of course only one space is needed for a sir\gle 
framework graph* 

Each question in the questionnaire offers several alternative responses 
each of which falls into one of these categories* Some responses imply that the 
graph has no problem^with regard to the issue addressed by the question* Other 
responses imply that the operating principle involved has been violated to a 
minor degree* These violations may be purely technical, causing np real impedi- 
ment to the chart or graph, or they may result in a minor initial confusion on 

* 

the part of the graph reader* Responses in this category are referred to as 
violations and are indicated by a single asterisk (*)* Anally, the third cate- 
gory of responses corresponds to severe '** lations of an operating principle 
which either cause a jreat deal of confusion before being resolved or render 
some facet of the graph completely uninterpretable * Responses of this type art; 
referred to as tauj.ts and are indicated on the questionnaire by a double 
asterisk (**)* 




A Tool for the Analysis of Charts and Graphs 

The following ?stionnaire can be used with either charts or graphs* How- 
ever, many ^f the questions are inappropriate for charts; these questions are 

I 

preceded by the synbol, (@), in addition, because graphs are so much more f v '- 
quently used, we have used some terminology that is specific to graphs, in parti- 
cular we have referred to frameworks as being composed of axes* In a graph, the 
framework defines the domain and range that are related together by a specifier* 
This usually is an independent variable (things being varied, such as time) and 
a dependent variable (such as tons cf wheat), with a line or bars serving as a 
function relating the two* In a chart, the framework is usually broken down into 
a set of boxes (as in a flowchart) or nodes (as in a family tree), and the speci- 
fier is composed of a sec of lines that relate these framework elements together 
in some way (j.n a linear sequence, hierarchically, etc*)* Tftus, when analyzing a 
chart, simply substitute "box" or "node", as appropriate, when reading "axis"* 
The same principles apply in the sam^ way to both charts and graphs* 

The reader should approach hhis questionnaire with a major caveat in mind: 
we can reveal violations of principles, but we cannot tell you whether these 
violation are important* That depends on the purposes to which the chart 01 

► 

graph is being put* Thus, after tallying up the sum of the violations, one must 
carefully consider the purpose of the display* Is it to express an idealization 
of some structure or relationship? If so, then great precision in internal map-' 
ping may not be required* Is it to display actual data precisely? If so, then 
virtually all violations will be important. Is it to lead the reader toward 
particular point of view? If so, then one may decide to violate the principle o£ 
invited inference. And so on* 

The vjtfe^ions ^n each of the two modules (single display, multiple display) 
aro numbered, *nd each beats a thre- letter cede which associates -ho question 
with sp*c\£ic level of analysis a nd operating principle* h list of the ptuim- 
pies <inj cod<?s is oiven oa the. n rt xt page* 
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Operatina Principles and 
Associated Codes 



Syntactic Principles 

S-AT adequate discriminai ' 

S-PL perceptual distortio. 

S-GO gestalt organisation 

S-OS dimensional structure 

S-PP processing priorities 

S-PL processing limitations 



Semantic Principl es 

M-SC Surface Compatibi lit^ 

M-SA Schema Availability 

Formal Principles 

F-IH internal mapping — / 

F-EX external mapping 



Pragmatic Principles 

. s 

P-II invited inference 

P-CX context 
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Module Is Analysis of a single display 
Syntax 



Syntax 



QUES- PRIN- 

TlOUn CIPLE Stl I i>[2) St3) 



J 

Outer Framework 



S-AD Are the marks defining or implying the outer frame- 
work sufficiently discriminable such that the function 
of this constituent is recognizable? 

1 yes 

2 a brief search i£ required for recognition {*) 

3 the function of the framework is very difficult to 
apprehend (**) * 



S-GO Are Gestalt factors applied to the marks which define 
or imply the outer framework such that the correct 
orqanizatioa is easily perceived? 

1 yes 

2 a brief search is required for recognition (*) 

3 framework is very difficult to perceive {**) 



@ 3 S-PL If hash marks a"** used to subdivide intervals between 
scale value labels along the axes* is the number of 
marks snail enough so that it can be apprehended at 
a glance? 

0 not applicable 

1 yes 

2 no, some thought is required (*) 

3 no, a great deal of thought is required {**) 
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QUES- PRIM- 

TIONB CIPLZ S[l] St 2 J S[3) 



Inner Framework 



If there is inner framework, go to question 7* 



@ 4 S-AL Are the marks defining or implying the inner frame- 
work sufficiently di^crinunakle such that this 
constituent is recognisable? 

' 0 not applicable 

1 yes 

2 a, brief search is required for recognition (*) 

3 framework is very difficult to perceive {**) 



S-GO Is the organization of marks defining or implying 
implying the inner framework sufficiently clear 
such that the constitutent is recognizabj ? 

0 not applicable 

1 yes 

2 a brief search is required for recognition {*) 

3 framework is very difficult to perceive (**) 



S~GQ is the relation between the inner and outer frame- 
work clear? 

1 yes ^ 

2 yes, but some thought is required (*) 

3 no (**) 



-P? Are the more visually^ salient features of either the 
inner or outer framework more inportant than the 
less salient feature-*? 

1 yrcs 

2 no {*) 



Syn tax 



QUES- PRlN- 

TIGHS CIPLE S(1j Sl2) St Z] 



Background 



8 S-PP If there is a background/ are the background figures 
or designs too dominant such that they obscure the 
presentation of information? (If there is no back- 
ground, go to question 10*) 

1 no 

2 yes, background figures hinder the presentation 
- sonewhat (*) 

3 yes, background figures severely hinder the 
presentation (** ) 



S-PL Does the number or complexity of background figures 
tax processing, leading to confusion? 

1 no * 

2 yes {**) 
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Syntax 



QUES- PRIU- 

TX0N4 CIPLE S [ 1 ] S[2) S[3) 



Specifiers 



10 S-hD Please fpply the following categories to describe 
the case of discriminability of those visual 
continua, listed below, which are applicable to 
this chart or graph* 

0 not applicable 

1 levels or variations aire easily discriminable 

2 some difficulty in discriminating (M 

3 much difficulty in discriminating (**) 

A shape 

B length 

C size 

D position 

B orientation 

F continuity 

G lightness (achromatic ) 

H lightness (chromatic) 

I hue 

J saturation 

K numerousness, 

L other 



11 S-GO Do the Gestalt principles (similarity, good form, * 
symmetry, proximity and good continuation) imply 
conflicting organization of specifiers? (i*e*/ 
are any Gestalt factors displayed such that the 
correct organization is de-emphasized?) 

1 no 

2 yes, a slight conflict exists which can easily 
be resolved (M 

3 yes, a severe conflict exists which cannot easily 
be resolved {**) 
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Syntax 



TIOHff CIPLE St 1 1 Si 2) St 3] 



Specifiers (Cont'd) 



@ 12 S-DS Is information concerning two distinct semantic 
variables being conveyed by tvo integral visual 
dimensions (e.g. oil production and population 
of a country conveyed respectively by the height 
and width of a rectangle)? 

1 no 

2 yes, but only a minor difficulty results (*) 

3 yes, and it is very difficult to extract infor- 
mation (**)- 



@ 13 S-PL If parts of specifiers are separately identified 
(^.g* each bar has several distinct and abutting 
segments), is the reader required to compare parts 
in order to extract information from the graph? 

0 not applicable 

1 no 

2 yes, however the information conveyed by these 
parts is of secondary importance (*) 

3 yes, and the information encoded in these parts 
is of major importance (**) 



14 S-PP If a specifier is used as a depiction, such that 

variations in si2e> orientation, etc. are important, 
are these variations emphasized so that they are 
encoded? 

0 not applicable 

1 yes, differences are very discriminate 

2 no, differences are only moderately dis- 
criminable (*) 

3 no, differences are not discrinunable (**) 
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Syntax 



TION* CIPLB stl) St2) St3] 



Specifiers (Cont'd) 



15 S-ad Is any local region of the graph area so densely 
packed that it is difficult to identify and 
interpret individual specifiers? 

1 no 

? yes, some difficulty results from local 
crowding {*) 

3 yes, a great deal of difficulty results from 
local crowding (**) 



16 S-PL Is the whole graph area so densely packed with in- 
formation that one cannot* understand the information 
presented? 

t no 

2 yes, some difficulty results from global 
crowding (*) 

3 yes, a great deal of difficulty results from 
global crowding {**) 



W 

17 S-PP Co all visually salient features of the specifiers 
bear information that 5^ pertinent to the intended 
message of the graph? 

1 yes 

2 no, however this results in only a minor 
processing difficulty (*) 

3 no, and this results in a severe processing 
difficulty (**) 



18 S-PP is the visual dominance of the elements consistent 
with the points to be made? (eig*, are the more 
salient features more important?) 

1 no inconsistency 

2 minor inconsistency (*) 

3 severe in^on^istt-sncy ( **) 
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Syntax 



QUES- PRIH- 

TI0H8 CIPLE St 1 1 S[2} S[3J 



labels 



19 F-EX Is there a title? 

1 yes 

2 no {*) 



If there is no title, go to question 22* 



20 S-PP is the title easily recognizable by virtue of 
its size and position? 

1 yes 

2 no, some search is required (*) 

3 no, very difficult to recognize (**) 



21 S-AD is the title legible? 

1 yes 

2 no, some effort is required (*) 

3 no, a great deal of effort is required {**) 



22 S-PP Is the subtitle easily recognisable by 
virtue of their size and position? (if 
there is no subtitle, go to question 24*) 

1 yes 

2 no, some search is required (*) 

3 no, very difficult to recognize ( ) 



2Z S-;o is the subtitle legible? 

1 yes 

2 no, <?ome effort is required (*) 

3 no, a great deal of effort is required (**) 



ERIC 



184 

18, 



Syntax 



CUES- PRItl- 

TlONtf CIPLE Sill S(2) SL ) 



Labels (Cont'd) 



24 S-PP Is the caption or legend easily recognizable? 
(If there is no caption or legend, go- to 
question 26*) 

1 yes 

2 no, some search is required (*) 

I no, a great deal of effort is required {**) 



25 s-AD is the caption or legend legible? 

1 yes 

2 no, some effort is required (*) 

I no, a great deal of effort is required {**) 



26 S-CO Do Gestait principles seem to associate variable 
names with the appropriate axis? {if there is no 
variable name on axes, go to question 28*) 

0 not applicable 

1 yes 

2 no, some attention is required (*) 

3 no, a great 4eai of attention is required (**) 



27 S-ad Are variable names legible? 

0 not applicable 

1 yes 

2 no, some effort is required (*) 

3 no, a great deal of effort is required (**) 



Syntax 



QUE5- PRin- / 

TIOMi* CIPLE Sfll St2l s(3l 

Labels (Cont'd) 



@ 28 S"GO Are scale values easily associated vith corres- 
ponding tick marks along the appropriate axis? 
(If there are no scale values, 90 on to 
question 30* ) 

1 yes 

2 no, some attention is required (*) 

3 no, a great deal of attention is required (**) 



@ 29 s-AO Are scale values legible? 

1 yes 

2 no, some effort is required (*) 

3 no, a great deal of effort is required (**) 



30 s-GO Are the units easily associated tfith the appro- 
priate scale values? (if unit of scale values are 
not marked, go to question 32*) 

1 yes 

(v-4l 2 no, some effort is required {*) 

3 no, a great deal of effort is required (**) 



G 31 S-AD if units are marked/ are the marKs legible? 

1 yes 

2 no, some effort is required (*) 

^ njt a great deal of effort is required (**) 
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QUES- PRltJ- 

TIGtltf CIPLE StlJ St2] SC 33 



Labels (Cont'd) 



32 S-GO Are variable Q r level names readily associated 
with the appropriate specifiers? 

0 not applicable / 

1 yes 

2 no, some attention is required (*) 

3 no, a great deal of attention is required (**) 



33 S-AD Are these variable or level names associated with 
specifiers legible? 

0 not applicable 

1 yes 

2 no, some effort is required (*) 

3 no, a great deal of effort is required (**) 



If no legend is used to label specifiers, go to question 35* 



34 S-PL Are too rsany pairs of items present in the legend 

such that it is difficult to remember associations? 

1 no 

2 yes (*) 



35 S-PP Is the visual dominance and form of label elements 
consistent with points to be made? 

1 yes 

2 no, minor inconsistency (*) 

3 no, severe inconsistency (**) 



ERJC 



187 



Semantics 



-QUES- PfclN- 

TICN* CIPLE St 1 ] S(2] S 1 3 i 



Framework 



@ 36 F-EX Do the syntactic cues regarding denseness or differ- 
entiation for any frame elements contradict the 
semantic implications regarding these' qualities? 

1 there is agreement 

2 there is a contradiction C*) 



37 F-EX Is there only one apparent way of interpreting each 
frame element? 

1 yes 

2 no, but after deliberation only one way is 
plausible (*) 

3 no, and there is no way of resolving which 
interpretation is correct (**) 



38 F-EX Is every necessary part of the framework clearly 
implied or present? 

1 yes 

2 not clear (*) 

3 no (**) 



39 M-sC If the framework depicts, is it clearly representative 
of the class for which it stands? 



0 not applicable 

1 yes 

2 no, the depiction is somewhat misleading (*) 

3 no, tht depiction is very misleading {**) 
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Semantics 



QUES- PRIN- 

Tiom CIPLE 4 S[l] S[2] S 1 3j 



Framework (Cont'd* ) 



40 M-SA Is the form of the framework likely to be understood 
by the intended reader? 

1 \ yes 

2 no, probably not (*) 

3 no, certainly not (**) 



*J 41 F-EX Do all parts of the framework £>lay a role in its 

h function? 

1 yes 

2 no, but this does not distract or confuse (*) 

3 no, and this is confusing (**) 



42 M-SA Are the variables associated with each frame element 
likely to be understood by the intended reader (e.g., 
derivatives, integrals, or other higher math concepts 
or technical terms)? 

1 yes 

2 no, probably not (*) 

3 no, certainly not (**) 



@ 43 F-EX Are any axes discontinuous or nort-uniform? 

0 not applicable 

1 no 

2 yes, in art obvious way 

3 yes, but nor in an obvious 'Jay (*) 
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Semantics 



TX0H3 CIPLC S[1) S(2) $iy 



Background 



44 F-EX If background figures are present, are they easily 
interpreted as such, or are they confused with 
specifier elements? 

0 not applicable 

• 1 yes ' 

2 no, some confusion occurs {*) 

3 no, a great deal of confusion occurs {**) 



1 90 ^ q 



Semantics 



T'.ON* CIPLC St 1 3 S(2) S E 3 ) 



Specifiers 



45 If specifiers depict, is depiction representative of 

class for which it stands? 

0 not applicable 

1 yes 

2 no, the depiction is somewhat misleading {*) 

3 no, the depiction is very misleading (**) 



46 M-SC If specifiers represent symbolically, are represen- 
tations easily connected to their refferen'.s? 

0 not applicable 

1 1<3S 

2 r t o, the association is somewhat counter intuitive (*) 

3 no, the association is very counter intuitive (**) 



\ . 

47 F-EX Is there only apparent way of interpreting each 
specifier? (For instance, is it clear wheu.er 
specifiers are contiguous or overlapping?) 

1 yes 

2 no, but after deliberation only one way is 
[V-9J plausible (*) 

3 no, and there is no way of resolving which 
interpretation is correct (**) 



40 F-IN Is three dimensional perspective used in a way such 
that some specifiers (or parts thereof) are altered 
in shape or sizti? 

1 no 

2 yes relative comparisons of sp^if iers are 
somewhat non veridical (*) 

3 yes* relative comparison of specifiers is very 
non veridical and/or mappings are obscured (*M 
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Semantics 



TlOm CIPLZ Stl) Sf^ 1 S( 3 J 



Specifiers (Cont'd) 

Q 49 S-PD Ace the specifiers presented in a way that allows 
the reader to make subjective quantitative compar- 
isons of elements based on visual inspection which 
are in accord with the actual quantitative 
relationships? 

1 yes - - 

2 no, subjective estimates are systematically off 
by a small amount {*) 

3 not subjective estimates are systematically off 
by a large amount (**) 



50 F-EX Are symbols representing different items differen- 
tiable? 

0 not applicable 

1 yes 

2 no, some attention is required {*) 

3 no, it is very difficult to apprehend 
differences ( ** ) 



51 F-EX Is every meaningful difference indicated clearly by 
a difference in marks? 

1 yes 



52 M-SC hue the visual continua along which specifiers vary 
compatible with information displayed? 

1 yes 

2 no, slightly incompatible (*) 

3 no, completely incompatible (* * ) 



ERJC 



192 



19o 



^OTTKintlCS 



QUES- PRIN- 

Tiom crpr S 



Shi S(2] S(3] 



Specifiers (Cont'd) 



53 H-SC Is the spontaneous interpretation of the specifier 
conpatible with the cognitive construct being 
represented? 

1 yes 

2 no, slightly incorcpatible'(*> 

3 no, conplet^ly incompatible (**) 
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Semantics 



9 



QUES- PRIN- 

TIOTW CIPLR Stll S[2I $t3J 



Labels 

If any of the following types of labels are absent or incomplete, 
assess the impact on the interpretaoility of the graph* (in each 
case, please use one of the following responses*) 

0 not applicable 

1 easily interpretable 

2 eventually internretable (*) 

3 uiii^terpre table ( ** ) 

54 F-EX Variable label on axes* 

55 F-EX Scale values on axes* 

56 F-EX Units of scale values* 

57 F-EX Labels on specifiers. 
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M-$C If depictions serve as labels, are they clearly repre- 
sentative of the class of objects for which they stand? 

0 not applicable 

1 yes 

2 no, the depiction is somewhat misleading (*) 

3 no, the depiction is very trisleading (**) 



59 M-SA Are the words used in labels clear and conpre- 
hensible to nhe intended reader? 

1 y*s 

2 no, probably not (*) 

3 no, certainly not (**) 



60 M-SA Are symbols used in labels familiar to the intended 
reador? 



0 not applicable 

1 yes 

I no, probably not (*) 

3 no, certainly not (**) 
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QUES- PRIN- 

TION# CIPLE St 1 ) S(2l St 3) 



Labels (Conf , ) 



6l F-Il] if specifiers are labelled in a legend or caption, 
is the correspondence between iteps on the tfratfh 
and those in the legend clear? 



0 not applicable 

1 yes 

2 ninor inconsistency (*) 

3 severe inconsistency (**) 
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Se nuntLcs 



QUES-* PRIM- 

TIOMf CIPLB Stl) S(2] S(3] 



Framework x Specifiers 



62 F^IN Is it clear which points of which axes are being 
related by each specifier? 

0 not applicable 

1 yes 

2 no, it is not obvious immediately but can even- 
tually be resolved (M 

3 no, and the correct correspondence cannot be 
deternined (**) 



@ 63 F-IH If any specifiers extend outside the region bounded 
by the frame elements, is there a consequent loss 
of precision in assigning quantitative values? 

0 not applicable 

1 no 

2 yes, but the loss o£ precision is not important 
for most purposes (*) 

3 yes, and the loss of precision severly hampers the 
use of the graph for its intended purpose (**) 



64 F-iU Is the level of precision of the seal* markings and 

hash marks along the frame compatible with the mappii^j 
precision afforded by the specifiers? 

1 yes 

2 no (*) 
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Semantics 



QUES- 
TION* 



PRIN- 
CIPLE 



S(1) St2] S(3J 



Framework x Specifiers x Labels 



65 F-EX Is there an apparent and logically consistent literal 
interpretation for each visually salient feature in 
the graph? 

1 yes 

2 no, minor confusion results (*) 

3 no, a great deal of conLusion results (**) 



66 M-3C Are the various graphic elements and visual proper- 
ties used in a way that is consistent with cultural 
conventions? (e.g*, red implies danger, green implies 
safe* If symbols have an accepted meaning is their 
use consistent with this meaning? [{/) implies okay, 
(x) implies incorrect!) 

0 not applicable 

1 yes 

2 no, cultural conventions have been ignored but not 
violated {*) 

3 no, the use of graphic elements and visual proper- 
ties is blatantly inconsistent with cultural con- 
ventions (**) 
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Pragmatics 

Pragmatics 



QUl^S- PRItl- 

TIONfl CIPLE St 1 1 S[2J St 51 



Framework 



£ 67 P-II Are scale units, aspect, or use of truncation of 
axes proper for the impression the illustrator 
wishes to convey? 



1 yes 

2 no (**) 



68 P-II If the framework depicts, does it convey a message 

consistent with the point of the graph? ^ 

^ 0 not applicable 

1 yes 

2 no, a slight contradiction is implied (*) 

3 no, a severe contradiction is implied (**) 



ERLC 



198 



201 



Pragmatics 



QUES- PRIN- 

TlOr^ CIPLE S(l] S[2) St3J 



„BacJc<f round 



69 P^II Do background figures, if present, convey a message 
consistent with the point of the graph? 

0 not applicable 

1 yes 

1 no, a slight contradiction is implied (*) 

3 no, a severe contradiction is implied (**) 



• 
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Pragmatics 



£UES- PRIN- 

TION* CIPLE Stl) SU 1 *13) 



Specifiers 



70 P-II Do the specifiers make or imply inferences that 
are contradictory to messages conveyed elsewhere 
in the chart or graph? 

\ no 

2 yes, slight contradiction* (* ) 

3 yes, glaring contradiction (**> 
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Pragmatics 



OUES- PRin- 

TlOtttf CIPLC S(1 ] Sl2) St3] 



Labels 



71 P-CX Is the graph suitably introduced by: 

1 title or subtitles 

2 comments 

3 caption 

4 no, it is not suitably introduced (**) 
(note: more than one answer is possible) 



72 P-II Do connotations of labels agree with the visual 
impact of the display? 

1 yes 

2 no {*) 



73 P-CX Are labels consistent with the terminology used 
in the text? 

0 not applicable 

1 yes 

2 no (*) 
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Pragma tics 



QUES- PRIN^ 
TIGNtf CIPLE 



S[1] St2] S[3) 



Framework x Specifiers x Labels 



74 P-CX Is the information presented in the chart or graph 
compatible with the adjacent text? 

0 not applicable 

* 1 yes 

2 no, the two are slightly inconsistent (*) 

3 no, the two are very inconsistent (**) 



75 P-CX Are the invited inferences compatible with the 
information presented in the adjacent text? 

0 not applicable 

1 yes 

2 no (*) 



• 
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Module 2t Organization of Subgraphs 



Syntax 



£UES- PRIM- 
CIPLE 



76 S-PP 



77 S-PP 



78 S-PL 



79 S-GO 



Does the physical arrangement of subgraphs lead the 
reader to examine them in a logical sequence? 

\ yes 

2 no, the arrangement is suboptimal (*) 

3 no, the arrangement is vejry confusing (**) 



Does relative visual saliency of subgraphs corres~ 
pond to the relative importance of the information 
presented in each display? 

\ yes 

2 no, but only a moderate problem (*) 

3 no, leading to confusion {**) 



Are there too many subgraphs to comprehend at once? 



2 yes (*) 



Do the Gestalt factors lead one to make the 
appropriate associat ons between items in a 
legend and their referrents in the various 
subgraphs? 

0 not applicable 

1 yes 

2 not some confusion exists {*) 

3 no, a graat deal of confusion exists (**) 
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Semantics 



QUES- PRIM- 

TI0M4 CIPLE S tl 1 S [2l si3l 



80 F-IN If there are instances where the same variable is 
discussed in several subgraphs, is the physical 
arrangement conducive to comparisons at equal 
values for all shared variables? 

0 not applicable 

1 yes 

2 no, however, in the context of the presentation 
such comparisons are not relevant 

3 no, and the usefulness of the graph is slightly 
impaired (*) 

4 no, and the usefulness of the graph is severely 
impaired (**) 



@ 81 F-IN If there are instances where the same variable is 

discussed in several subgraphs, are the axis scales 
for the shared variables marked iri the same units 
per inch? 

0 not applicable 

1 yes 

2 no, however, this is in the best interest of 
communicating the information 

3 no, and the usefulness of the graph is slightly 
impaired <*) 

4 no, and the usefulness of the graph is severely 
impaired (**) 



82 F-IN If one subgraph presents a second view of the 

information in another subgraph, is the correspon- 
dence between the two subgraphs clear? 

0 not applicable 

1 yes 

I no, some inspection is required (*) 

3 no correspondence is evident (**) 
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Semantics 



QUES- PRIN- 
TIOH# CIPLE 



S[1] S(2] Sl3] 



83 F-EX 

^ 84 F-EX 

85 F-IN 



Are the title, comments, and other labels, in con- 
junction with the graphic material, sufficient to 
explain the relationship between the various 
subgraphs? 

1 yes 

2 no, soma aspects remain unclear (*) 

3 no, the overall relationship between subgraphs 
remains uoclear (**) 



If lines or other marks are used to relate subgraphs, 
is it clear how each of them functions? 

0 not applicable 

1 yes 

2 some doubt exists (*) 

3 probably not (**) 



If the different subgraphs employ constituents of 
different forms serving the same purpose, does this 
increase the workload on the reader? 

0 not applicable 

1 no 

2 moderately {*) 

3 yes {**) 
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Questionnaire Feliability 

The success of our questionnaire as an evaluative aid hinges on its 
reliability* Reliability, as the term is used here, concerns the degree co 
which diffetent analysts using the questionnaire to analyze a given chart or 
gi t agree in their analysis* Obviously the questionnaire would be worthless 
if each independent person using it to evaluate the same graph produced a 
different set of violations* 

The way i*\ which we assessed the reliability of the questionnaire is best 
illustrated by considering the possible outcomes of an analysis of a chart J or 
graph by two different analysts* The possible outcomes are presented in Table 
5*3* 

INSERT TABLE 5*3 HERE 
One outcome has analyst 1 scoring a problem with respect to a particular 
operating principle while analyst 2 scores no such problem* A second outcome 
has both analysts agreeing that no problem occurred* A third outcome has 
analyst 1 scoring no problem while analyst 2 scores a problem* Finally* the 
fourth outcorce has both analysts agreeing that a problem has occurred* If we 
divide the total number of agreements (a+d) by the total number of questions on 
which agreements were possible (a+b+c+d), we get an agreement rate, r, 

a+d 

r% = x 100 (1 ) 

a+b+c+d 

Mote that this rate can vary from 0% (perfect disagreement) to 100% (perfect 
agreement) * 

The overall agreement rate for our questionnaire was determined by aggre- 
gating the possible outcomes of two analysts who independently evaluated ten 
separate graphs (four of which were multiple framework) randonly selected from 
thtr sample* Both violations (*) and faults (**) wers classed as equivalent and 
formed a single category called problems* This category w<is then contrasted 
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• with the category called no problems* One analyst was very experienced with 
the scheme and the other analyst was naive at the outset. Because there are 75 
questions in nodule 1 (dealing with single frameworJc graphs! and 10 additional 

9 questions in module 2 (dealing with multiple framework graphs) there is a total 

of 790 questions ^m which agreement and disagreement is based, Tht oial 
values are presented in Table 5.4* From this table, the rate is 06*58 percent 

^ indicating a fairly strong agreement between the two analysts on what consti- 

tuted a problem in this group of graphs* 

INSERT TABLE 5*4 HERE 

^ A closer examination of the data reveals a greater than 97 percent agree- 

ment rate between the analysts on seven of the ten graphs* Of the remaining 
graphs, only one fell belcw the 90 percent rate, a low of S3 percent* An 

^ examination of the data associated with this worst case reveals more than half 

the disagreements between the analysts concerned the formal principle of exter- 
nal mapping* This is not surprising since overall/ ignoring the particular 

9 violation cited, and simply noting whetKefr there is agreement about a syntac- 

tic, semantic, formal or pragmatic violation, the formal analysis yields the 
lowest agreement rate, a 94*39 percent rate* This is to be contrasted with the 

• 95*45 percent rate for the closely aligned semantic principle, and 97*00 per* 
cent and 93*09 percent agreement rates for the pragmatic and syntactic princi- 
ples respectively* Similarly, if we simply rate whether the analyst spotted a 

# problem with the framework, specifier, labels or background there was high 
agreement on localizing problems to these basic cons titutents with rates rang- 
ing from a low of 95*78 percent for the specifier to a high of 97*65 percent 

# for the framework* 

The upshot of these analyses is straightforward; even a naive analyst is 
able to use the questionnaire to reveal basic problems wj^jth a graphic display 
9 and the instrument is quite reliable* It is worth notirg in addition that the 
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analysts almost always saw the same problems in a display, but differed in the 
way th - conceptualised (and hence categorized) the problems* The bases for 
such differences will be developed in the following chapter* 

Application of the Questionnai " :> a Re L tentative Sample of Charts and 
Graphs 

In the previous sections we have shown the questionnaire to be reliable in 
that two analysts independently discovered essentially the same violations in a 
set of ten graphs* i n this section we report the results of applying the 
questionnaire to a substantial subset of the charts and graphs which were • 
collected according to our sampling scheme* The purpose of this effort was 
two-fold: First it served as a final test of the questionnaire on a diverse 
set of graphs takon from commonly encountered reading material, and second, it 
provides a description of patterns of operating principle violations in various 
categories of charts and graphs* 

The sampling scheme was described in detail in a previous section* We 
randomly selected one graph from each of the 75 non-empty cells of the sampling 
scheme (see Figure 5*1 ) and divided these between two analysts who then applied 
the questionnaire to each graph* 

The reader can get a detailed view of how the non-empty cells of the 
sampling scheme are distributed with respect to chart and graph categories in 
Figure 5*1, however, Table 5*5 summarizes the gross features of the distribu- 
tion* Note from the table that the numbers of graphs analyzed were fairly 
evenly distributed amongst the content areas* Most of these graphs^ though, 
were found Ln adult journals and textbooks indicating a dearth of this material 
in publications for children* Similarly, the distribution of formats corres- 
pond to the predominance of bar charts and line graphs found in the litera- 
ture* 
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INSERT TABLE 5.5 HERE 
The distribution of the number of faults or serious violations per graph 
for each of these categories is summarized In Table 5.6„ From this table, the 
distribution of faults within a category follow, the independence model for all 
categories with the exception of content ai ■ t . For this category, graphs found 
in the business area result in a much higher number of faults than graphs found 
in other areas. This observation is confirmed by a reliable chi-square, x^^^ 
- 13.56, p<*02. A closer examination of the data for this case reveals that 
these graphs are not as well executed as graphs found in the other content 
areas. That is 41.1 percent of all faults concerning principles pertinent to 
organization (e.g., similarity, proximity, etc.) and 38.9 percent of all faults 
concerning principles pertinent to seeing the lines (e.g.* discriminabili ty) 
occurred in this content area. Additionally the graphs sampled from this con- 
tent area employ symbolic representations that are not as compatible with thair 
reference nor as consistent with the conventions of our culture as graphs found 
in other areas. This is evidenced by the fact that of all the faults concern- 
ing the pr nciple of surface compatibility* 62.5 percent were found in graphs 
taken from the business area* 

INSERT TABLE 5.6 HERE 
The distribution of faults per question set as a function of the different 
levels of description and the distribution of faults per question set for each 
of the different operating principles are shown in Tables 5.7a and 5.7b respec- 
tively, while neither table is consistent with the independence hypothesis, 
some comments are in order. Note that faults pertaining to formal principles t 
specifically the principle of external mapping occur most frequently. Recall 
that this principle pertains to the meaning given the marks and is violated if 
a nark is ambiguous or a necessary set of marks is missinq. 
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The discussion in Chapter 4 lists some of the pitfalls to be avoided • Addi- 
tionally, the use of a title as a descriptive aid will help orient the reader 
and thus reduce potential ambiguity, 

INSERT TABLE 5*7 HERE 

Also note from Table 5.7b that another area in neea of impro^ ment in- 
volves the organisation of the marks. Most often carelessness is the culprit 
in this case. A careful analysis of the graph upon completion should help 
reduce violations of this principle. 

Table 5,$ shows the distribution of faults per question set as a function 
of the different graphic constitutents and their combinations. The table shows 
the greatest proportion of faults pertains to the specifier alone and its 
interaction with the framework. The incidence of faults for th s basic level 
constituent is roughly two times greater than the other basic levtls and this 
increased incidence is reliable, x 2 {^)=21.19, p<.01. 

INSERT TABLE 5.8 HERE 

The breakdown of the proportion -of faults, in terms of specific operating 
principles violated, for these two constituents is shown in Table 5.9. From 
this table, the proportion of faults pertaining to the interaction between 
framework and specifier is entirely due to violations of the internal mapping 
principle. Violations, in this case, usually result when the graph maker tries 
to represent two or more range scales on a single framework graph, having 
multiple specifiers. If such a condition is envisaged it may be preferable to 
represent the information using multiple frameworks. With regard to the speci- 
fier, we note, as before, that most of the proportion of faults is accounted 
for by violations of the external mapping and discriminability princples. 

INSERT TABLE S.9 HERE 

In summary, the pattern of results reported in this chapter suggest two 
riroas of improvement in graph design. First, w« must ensure that the interpre- 
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tation of the tjraph is not contaminated by either the addition of too much 

information (such as the careless placement of a second range scale on a single 

framework cjra^h) or the deletion of relevant information (such as a title)* 

■ 

Second, we should exercise care in the e v cution of the graph by ensuring that 
adequate discrimmability and organizati^. result over a wide range of condi- 
tions ana graph readers* 
Two Graphs Seen in a New Ligh t 

At the beginning of Chapter 1 we saw two graphs (Figures 1*1 and 1.2) that 
ware obviously flawed. However, most people could not say exactly how these 
displays were amiss, but had only haphazard intuitions and sketchy diagnoses* 
Let us return to those graphs now, armed with the diagnostic tool just 
presented. 

Figure 1.1: Falling interest rates. 

In using the questionnaire the following violations of Figure 1*1 were 
revealed: 

Syntax : 

The specifier violated two syntactic principles. The principle of ade- 
quate discriminability was violated because the colors of the lrnes were too 
similar, making it almost impossible to tell them apart when they crossed. The 
principle of processing priorities was also violated because the specifier 
.lines differed dramatically in their saliency, but this difference did not 
reflect any difference in the importance of the information being presented. 

The labels violated one syntactic principle in three different places. 
First, the principle of gestalt organization was violated because scale values 
ware not associated with the tick marks/ second, it was violated because units 
were not associated with the scale values/ and third, it was violated because 
variable or l^val names were not associated well with the specifiers (at tho 
far rifjht) * 



2H 



Semantics - 

The framework violated one semantic principle* that of representativeness. 
The framework did not clearly depict a bank, and this depication was* there- 
fore, slightly distracting. 

Fortnal : 

The framework violated the external mapping principle because its markings 
were not consistent with the concept being represented; although this is a 
minor point in the present case* 'the graph would have been useful more gene- 
rally if the axes had been marked into discrete units. 

The specifier violated the internal mapping principle because the fore- 
shortening of the framework resulted in difficulty in comparing the foroi of the 
functions at different places in the graph* ifrat is, the slopes must be 
mentally adjusted to compensate for the distorted framework in order to compare 
slopes at different points along the specifiers* 

The graph as a whole violates the external mapping principle because thore 
is no logically-consistent interpretation for all of the marks* for example, 
why are sotne labels along the axes inside the framework whereas others are 
outside the framework? 

Pragnatics : 

Finally, the pragmatic principle of invited inference was violated because 
of the foreshortening of the framework: although this does succeed in making 
the fall seem steeper, it also makes the rise seem steeper — which is an acci- 
dental byproduct of the attempt at distortion* 
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Figure 1*2: Ecological niches 

This graph was baffling to many people; our system explains why* 
Syntax: 

The specifiers violated four of our syntactic principles: First, th^ 
princi t of dimensional structure was violated because peopl ->ee rectangles — 
and the specifier is in fact two distinct extents (one -horizontal and one 
vertical); because the dimensions of sides of rectangles are integral, the 
value along one extent cannot help but influence how we see the value along the 
other* Second, the principle of adequate discrinunability is violated because 
some of the specifiers are hard to see. Third, the principle of processing 
priorities is violated because the most visually striking specifiers are not 
necessarily the most important* Fourth, the principle of gestalt organization 
is violated because when two of the specifiers overlap, new rectangles are 
formed by the patch of common color* But these rectangles do not represent 
additional specifiers, and hence are very misleading* 

The lables violated two syntactic principles* The title is difficult to 
see, violating the principle of adequate discriminabi lity* The keys are 
divided into two segments, and group via proximity to distinct panels — even 
though all six key elements are relevant to each panel* Thus, the key violates 
the principle of gestalt organisation* 

Formal : 

Finally, this graph violates a formal principle* The external mapping 
principle is violated because there is more than one way of interpreting the 
specifier elements* 

Thus, it is clear that our analytic system not only provides insight into 
what previously was pretty murky territory, but generalises to displays quite 
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unlike those that originally shaped it (at the end of Chapter 2), The analytic 
scheme has now gone about as far as possible given the level of sophistication 
of the psychological theorizing engaged in up until now. Thus, in the next 
chapter we vriil consid* further developments in the context of developing a 
detailed theory of gru, , reading per se. 
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CHAPTER 6; A THEORY OF GRAPH COMPREHENSION 



So far we have concentrated on analyzing existing displays* But it is far 
better to draw a good graph to begin with than to correct one after the Hact* 
In order to know how to generate an effective graph, however, we musi .ave a 
theory of how a reader will actually process the display while reading it* 
This theory can then be used to guide the graph maker to construct an unambigu- 
ous, effective display* The simplified treatment of visual information proces- 
sing presented in Chapter 2 is too sketchy to serve these ends* Thus, in this 
chapter we will consider in detail how people come to understand charts and 
graphs* 

I* Introduction 

Unlike seeing in depth, uttering a sentence, or reaching for a target, 
comprehending a graph is not something that anyone could argue is accomplished 
W a special-purpose mental faculty* Graphs are a recent invention in the 
history of our species, and if they are an especially effective method of com- 
munication, it must be because they exploit general cognitive a*id perceptual 
meVhanisms in an optimal way* A theory that hopes to explain the process of 
graph comprehension will have to identify the psychological mechanisms used in 

interpreting a graph, and a theory that hopes to lead the way to more compre- 

\ 

hensi^ble " graphs and more efficient graph readers will haw to specify which 
operating principles of each mechanism contribute to the overall ease or diffi- 
culty of a graph* Thus, a theory of graph comprehension will draw heavily on 
general^ cognitive and perceptual theory* and where our knowledge of cognitive 

and perceptual mechanisms is sketchy, we can expect corresponding gaps in our 

i 

abilicy to explain the understanding of graphs* Hie worth of a theory will 
probably 'lie not so much in its current successes in accounting for data and 
guiding the graph maker as in its promise of offering deeper and deeper explun- 
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ations of graph comprehension as it absorbs the future discoveries of cognitive 
science * 

As was revealed in ar survey, there is a bewildering variety of graphs in 
current ose, ranging from the line and bar graphs common in scientific jour- 
nals, to drawings in popular magazines i rfhich the thicknesses of two boxer's 
arms might represent the missile strength of the US and USSR, or in which the 
lengths of the rays of light emanating from a yellow disc might represent the 
price of gold in different months. Nonetheless, all graphs can be given a 
common characterization* Each graph tries to communicate to the reader a set 
of pairings of values on two or more mathematical scales* using objects whose 
visual dimensions (i*e* # length/ position* lightness, shape, etc*) correspond 
to the respective mathematical scales, arJ whose values on each dimension 
(i*e*/ an object's particular length/ position/ and so on) correlate with the 
values on the corresponding scales* The pairing is accomplished by virtue of 
the fact that any seen object can be described simultaneously by its values 
along a number of visual dimensions* For example, Figure 6*1 represents a pair- 
ing of values on a nominal scale (countries) with values on a ratio scale (GtiP) 

/ 

using objects (bars) whose horizontal position (a visual dimension) corresponds 
to a value on the first scale/ and whose height (another visual dimension) 
corresponds to a value on the second scale* 

INSERT FIGURES 6*1 AND 6*2 HERE 
Figure 6*2 represents a pairing of values on an ordinal scale (months) 
with values on an interval scale (temperature) using objects (wedges) whose 
radial position represents the month, and whose darkness represents the temper- 
ature* This characterization/ which can be applied to every graph we have 
seen/ was first pointed out by Bertin (1967) in his seminal treatment of 
charts, graphs/ and maps* 
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ks Bertin points out, this characterization implies that a graph reader 
must do three things: a) identify, via alphanumeric labels, the conceptual or 
real-world referents that the graph is conveying information about (Bertin 
calls *"his "external identification"), b) identify the relevant dimens' * of 
variai- jn in the graph's pictorial content, and determine which visual aimen- 
sions corresponds to which conceptual variable or scale (Bertin's "internal 
identification"), and c) use the particular levels of each visual dimension to 
draw conclusions about the particular levels of each conceptual scale (Bertin's 
"perception of correspondence")* 

Even a characterization as simple as this one raises a host of psycho- 
logical questions, and until these questions are answered, we will not be able 
to predict what will make a particular graph easy or difficult to comprehend* 
These questions subdivide into two classes* First, note that a graph reader 
must mentally represent the objects in the graph in only a certain way* in the 
case of Figure 6*1, he or she must think of the bars in terms of their posi- 
tions on the page, the jagged contour' formed by the tops of the bars* their 
left-to-right order, and so on* This raises questions about how a visual 
stimulus is encoded internally/ or, in the terms of the theory we will outline 
here, how the reader's visual description of the graph is built up* Second, 
the graph reader must remember or deduce which aspects of the visual consti- 
tuents of the graph stand for which of the mathematical scales that the graph 
is trying to communicate* This raises questions about how knowledge in memory 
interfaces with visual input, or, in the terms of the present theory, how the 
reader's g raph schema will spell out the ways in which the physical dimensions; 
of the graph may be mapped onto the appropriate mathematical scales* in using 
the "visual description" and the "graph schema" to interpret a graph, a reader 
may obtain different sorts of information about it* Bertin points out that a 
reader can extract the exact value of some seal** paired with a givon value on 
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another scale, the rate of change of values on one scale within a range of 
values on another, a difference between the scale values of tuo entities, and 
so on* We will use the term conceptual question to refer to the particular 
sort of information that i reader wishes * ^ extract from a graph, and 
conceptual message to re + ,r to the information that the reader, in fact, takes 
away from it (cf* Bertin, 1967)* 

In the rest of the chapter, we go beyond Be r tin's work by defining and 
characterising each of the mental representations involved in giaph comprehen- 
sion, proposing ways in uhich they are constructed and transformed in the 
course of reading a graph, and attempting to outline principles that dictate 
which aspects of these processes and representations affect the ease of 
extracting a message from a graph* These principles will provide the theore- 
tical basis for the operating principles discussed in the preceding chapters, 
replacing the simplified theory of visual information processing presented in 
Chapter We will try to justify these proposals by appealing to existing 
knowledge of perceptual and cognitive . functioning, and by showing concrete 
instances of graphs and other visual displays whose degree of intuitive diffi- 
culty is explained by the proposals* Of course, the ultimate empirical test of 
the theory will be its ability to explain the relative ease with which various 
sorts of people extract various sorts of information from various sorts of 
graphs, over as wide a range of people, messages, and graphs as possible* 

II* The Visual Array 
The information in a graph arrives at the nervous system as a two-dimen- 
sional pattern of intensities on the retinas* We will use the te^m visual 

i 

array to refer loosely to those early visual representations that depict the 
input in a relatively unprocessed, pictorial format (cf. the "2-1/2 dimensional 
sketch" of Marr s Nishihara, 1977, and the "surface array" of Kosslyn, Pinker, 
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Smith, £ Shwarts, 1979)* Information in this form is, of course, far too raw 
to serve as a basis for comprehending the meaning of the graph* For that, we 
need a representational format that can interface easily with the memory repre- 
sentations embodying knowledge of what the visual marks of the graph signify* 
Such memory representations cannot be , jted in terms of specific distribution^ 
wif light and dark as would be represented in the visual array, because vastly 
different intensity distributions (differing in size, orientation, color, 
shape, lightness, etc*) could all be equivalent exemplars of a given type of 
gr-ph* Thus, the representation that makes contact with stored knowledge of 
graphs must be more abstract than a visual array* 

XXI* The Visual Description 
A fundamental insight into visual cognition is that the output of the 
mechanisms of visual perception is a symbolic representation or "structural 
description" of the scene, specifying the identity of its parts and the rela- 
tions among then (see Winston, 1975; Marr £ Nishihara, 1977; palmer, 1975; 
Pylyshyn, 1973)* This mental description is not in English, of course, but in 
sone symbolic "language of thought" which represents visual information in a 
manner appropriate to its use by other cognitive processes such as language, 
reasoning, motor control, and so on* In this description, the various aspects 
of scene, such as its constituent elements, and their size, shape, loca- 
tion, color, texture, etc*, together with the spatial relations among them, 
will be factored apart into separate symbols* As a result, each higher-level 
cognitive process need only refer to the symbols representing the aspect of the 
scene that is relevent to its own computations* For example, processes govern- 
ing limb movement will access symbols explicitly representing an object's posi- 
tion in the three-dimensional world, whereas processes that formulate the 
sequence of words that will be uttered in response to the question "What color 
is that shirt?" will access symbols explicitly representing an object's hue* 
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This allows us to describe the mind economically as & set of more-or-less auto- 
nomous nodules (see Simon, 1969); there is a visual system which need "know" 
nothing about either English syntax or skeletal musculature and, a linguistic 
system, which need "know" nothing about the 1" «j of perspective, and a motor 
control system which need "know" nothing abouv the laws o.^ jolor mixture--all 
the systems can communicate via a common symbolic description of a scene. We 
will use the term visual description to refer to the structural description 
representing a graph* and visual encoding processes to refer to the mechanisms 
that create a visual description from a visual array pattern* 1 

Many "languages" for visual descriptions have been proposed in the litera- 
ture on vision in psychology and artificial intelligence (e,g*, Palmer, 1975; 
Marr £ Nishihara, i 977/ Hinton, 1979; Winston, 1975/ Miller £ Johnson-Laird, 
1976)* Most of them describe a scene using propositions, whose variables stand 
for perceived entities or objects, and in which predicates specify attributes 
of and relations among the entities* It is assumed that the visual encoding 
mechanisms can detect the presence of. each of these predicates in the visual 
array* Tor example, one-place predicates specify a simple property of an 
object, such as Circle (x) (i*e* "x is a circle 11 ) , Convex (x), Curve (x)t Flat 
(x), Horizontal (x), Linear (x), Small (x)* and so on* Two-place predicates 
specify the relations between two objects, such as Above (x,y) (i*e* "x is 
abov* y"), Adjacent (x,y), Below (x,y), Higher (x,y), Included-in (x,y), 
Points -towards (x,y ) , Parallel (x,y ) t Part (x, y), Wear (x, y ), similar (x* y ) , 
Top (x,y), and so on* Three and higher-place predicates indicate relations 
among groups of objects, such as Between (x,y,z) (i.e*, 11 x is between y and 

*Note that our use of structural descriptions to represent the information in a 
graph does not bear on the devate over whether mental images involve 
information in an array or a structural description (e*g«, Kosslyn, Pinker* 
Smi^h and Schwartz, 1979)* That debate is not over whether arrays and 
structural descriptions exist in general, but whether the array can bs* fcilloJ 
with information from long term memory as well as from the eyes* 
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z"), In-line (x#y,z), and so on* Parameterized predicates take a number ot 
variables and a number of quantitative constants/ such as Area (x# a) (i*e*, H x 
has area Width (x,ot), Location (x,a,ti), Lightness Orientation 

(x,ct), and so on* These predicates may also be appropriate for specifying 
continuous multidimensional attributes of objects* which otherwise would be 
difficult to specify by a predicate chosen from a finite list* For example, 
any member of a class of shapes ranging from a flattened horizontal ellipse 
through a circle to a flattened vertical ellipse can be specified by two param- 
eters, representing the lengths of the major and minor axes of the ellipse/ 
thus; Ellipse (x,a, 3)* 

As is fitting for a paper on graphs, we will use a graphic notation for 
visual descriptions* Each variable in a description will be represented by a 
small circle or node in which the variable name is inscribed (for simplicity's 
sake, we will usually omit the variable name in the diagrams to be used in the 
chapter); each one-place predicate will simply be printed next to the nodes 
representing the variables that they Are true of; and each two-place predicate 
will be printed alongside an arrow linking the two nodes representing the pre- 
dicate's two arguments* Thus* a particular scene represented as the visual 
array in Figure 6* 3a will be represented as the visual description in Figure 
6*3b, or its graphic counterpart in 6*3c* 

INSERT FIGURE 6*3 HERE 
Constraining the Visual Description 

If, as we argued, a visual array representation is unsuitable for tho com- 
putations involved in extracting information from a graph, an unconstrained 
visual description is not much better* Since any visual array can b*s described 
in an infinite number of ways, a theory that allowed any visual description to 
be built from a visual array would b* 1 unable to predict what would happen whon 
a given individual faced a given graph* For example* the array in Figure 6*3^ 
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can give rise not only to the visual description in Figure 6.3c, but to the 
descriptions in Figure 6*4 as well* 

INSERT FIGURE 6*4 HERE 
Clearly, if it is not to be utterly vacuous, f v * theory must specify which 
visual description is likely to be constructed in a ^iven situation, based on 
our knowledge of how the human visual system works* In the following section, 
we summarize four broad principles, each grounded in basic psychological 
research/ which constrain the form of visual descriptions* These principles 
will bear a large explanatory burden in the theory to be outlined here, since 
later we will claim that a prime determinant of the difficulty of a graph will 
be whether the visual description specifies explicitly the visual dimensions 
and gtoupings that the graph maker recruited to symbolize the mathematical 
scales i-nvolved in the message of the graph* 
A* The Indispens i bility of Space 

It has long been known that an object's spatial location h?s a different 
perceptual status than its color, lightness/ texture, or shape* Bertin (1967) 
tries to formulate this generalization by distinguishing between the two 
spatial dimensions of the surface of the paper (his "dimensions du plan", 
loosely* "framework dimensions") and other dimensions such as lightness and 
color (which he calls "retinal dimensions")* Michael Kubovy (1981) has 
addressed this issue systematically/ and calls the two spatial dimensions of 
vision (plus the time dimension) lndlspensible attributes , analogous to the 
dimensions of pitch and time in audition* He defines the term "indispensible 
attribute" as an attribute with the following properties: 

1) Perceptual Humerousity , The first constraint on a visual description 
mast be on what is to count as a variable or node. Variables should stand for 
psrceptua 1 units of some sort, and not for any arbitrary subset of the light 
reflected from a scene (e*g», the set of all light patches whose dominant wavo- 
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length is divisible by 100)* 2 Kubovy points out that our perceptua-1— systems 
pick out a "unit" or an "object" in a visual scene as any set of light patches 
that share the same spatial position/ but not as a set of light patches that 
share some other attribute such as wavelength, intensity, or texture* Thus* 
Figure 6.5a will give rise to the visual description in Figure 6.5b/ which 
partitions the array into three variables according to spatial location rather 
than that in Figure 6.5c/ which partitions the array into two variables accord- 
ing to surface markings. 

INSERT FIGURE 6.5 HERE 
2) Configural Properties . The second constraint on a visual scene is the 
choice of predicates available in assembling a visual description. Naturally, 
there will be predicates corresponding to all perceptible physical dimensions 
(e.g./ bright (x), red (x), shiny (x)/ lightness (x,a), length (x/a>; in addi- 
tion, there will be "conf igural" or "pattern 11 predicates corresponding to 
higher-order functions defined over the physical dimensions. Kubovy points out 
that most configural properties in a sensory modality are defined over the 
indispensible attributes, which in the case of static objects vision are the 
vertical and horizontal spatial dimensions. As a consequence/ there exist many 
predicates for spatial shapes (each of which can be defined by certain well- 
developed chances in relative horizontal and relative vertical positions in a 
pattern), but few for nonspatial "shapes" defined by analogous veil-defined 
changes in other dimensions. For example, the array in Figure 6.6a contains 
elements whose heights increase with their horizontal position (lightness vary- 



2 This question, incidentally, is begged by Bertin's proposal that the diffi- 
culty of a graph may be predicted by hew many "perceptual glances" a reader 
must make in reading a graph. Until we know what forms a perceptual unit thdt 
a "glance" centers upon, we will not know how many glances must be mad«3 > 
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ing randomly),; the array in Elgure 6.6b jcontains elements whose lightnesses 
increase with their orientations (position varying randomly)* 

INSERT FIGURE 6*6 HERE 
However, the increase is immediately noticeable only in Figure 6*6a, where the 
increase is of one atial dimension with respet, to another, not in 6* 6b* 
Correspondingly, there exists a predicate diagonal (x) that can be used to 
describe the scene in 6*6a, but nothing analogous for describing the scene in 
6*6b, whose elements wou^d probably be specified individually* Note that as 
long as one member of a pair of related dimensions is spatial, there may be 
configural predicates available; when neither member is spatial, . configural 
predicates are unlikely* Thus, the elements in Figure 6*7 get darker with 
height, a change that, unlike that in 6*6c, is quickly noticeable, and may be 
captured by a single predicate (e*g*, darkens (x)J* 

(insert Figure 6*7 Here) - 
3) Discriminability and Linearity * As we review in Chapter 3 of this 
volume, physical variables are not in general preceived linearly, nor are small 
differences between values of a "physical variable always noticed* In the vis- 
ual description, this corresponds to numerical variables (e*g*, height (x,17)J 
being distorted with respect to the real world entities they represent, or to 
distinct numerical variables sharing the same value when the represented enti- 
ties in fact differ (e*g*, lightness (x,17); lightness (y,17J for two boxes 
differing slightly in lightness)* Kubovy remarks that indispensible attributes 
afford finer discriminations and more linear mappings than dispensible attri- 
butes, and indeed, our summaries in Chapter 3 show that the Weber fraction for 
spatial extent is 0*04, and the Stevens exponent is 1*0, both indicating 
greater accuracy for the representation of spatial extent than for the repre- 
sentation of other physical variables* 



ERLC 



224 



4) Sele ctive Attention* As a consequence of (1), each variable may have 
associated with it a unique pair of coordinates representing its location* 
This means that location could serve as an index or accessing system for visual 
infornation* This is a form of selective attention/ and Kubovy summarises 
evidence supporting the hypothesis u ^ attention is more selective for the 
indispensible attributes (horizontal and vertical location) than for other 
visual attributes (e*g*/ one cannot easily attend to all visible objects with 
the same lightness or shape/ regardless of location/ see Posner and Snyder, 
1980, for example)* In the theory outlined in this chapter, selective atten- 
tion according to location will consist of a mechanism that activates various 
encoding mechanisms to process a given spatial region of the visual array/ in 
order to encode nore predicates into the visual description or to verify 
whether a given predicate is true of the entity at that location* 3 As we shall 
see, these mechanisms will play an important role in the "question-driven" or 
"top-down" processing of graphs* 
B. Gestalt Laws of Grouping 

The principles associated with the indispensibility of space in vision 
place constraints on the portions of an array that variables may stand for/ on 
how numerical variables represent physical continua, and on how predicates are 
encoded or verified with respect to the visual array* What is needed in addi- 
tion is a set of principles governing how variables representing visual enti- 
ties will be related to one another in visual descriptions/ that is/ how the 
atomic perceptual units will be integrated into a coherent percept* A notable 



J This proposal is similar to P*rtin's connective that a focal percept (his 
"imagp", the content of a "perceptual glance") may consist of a spatial loca- 
tion plus the value of one "retinal dimension" at that location* I£ is not 
clear, however, why one should suppose that only one nonspatial dimension can 
be encoded at a given location* Indeed* in the discussion of coordinate sy^~ 
tens for nonspatial dimensions, we discuss evidence that in fact several 
physical dimensions may be encoded simultaneously by the human visual systom* 
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set of such principles is the Gestalt Laws of Perceptual Organization (see 
Wertheimer, 1934; Chapter 3, this volume). These laws dictate that distinct 
static perceptual elements will be seen as belonging to a single configuration 
if they are near one another ("proximity"), simile in terms 4 of one or more 
visual dimensions ("similarity"), smooth continua, _ons of c another ("good 
continuation' 1 ) or parallel ("common fate") in the 2D plane* In terms af the 
visual description, these principles will determine how variables are linked 
via the "part" relation in structures like those in Figures 6.8a (where the law 
of similarity links asterisks to asterisks and circles to circles), 6.8b (where 
common fate links the asterisks to the line/ and similarity links the asterisks 
to one another), and 6.3c (where good continuation keeps the straight and 
curved lines distinct, proximity links the asterisks and crosses to their 
respective lines, and similarity links asterisks to asterisks and crosses to 
crosses). Figure 6.8d shows how 6.8c would be represented in a visual descrip- 
tion. 

INSERT FIGURE 6.8 HERE 
There *s another way of indicating the effects of grouping within visual 
descriptions. That is to link each member of a group to every other member 
using either the relation that gave rise to the grouping, or siiaply the rela- 
tion "associated with 11 . Thus* the visual array in Figure 6,3a, above/ could 
also be represented as in Figure 6.9; 

INSERT FIGURE 6.9 HERE 
This notation can be used to indicate that the variables are grouped together 
perceptually, but not so strongly as to be a distinct perceptual unit. In the 
rest of this chapter, we will use both notations for grouping, though no theo^ 
retical distinction will be implied by the choice. 
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C* Representation of Magnitud e 

Implicit in our earlier discussion of the psychophysics of visual dimen- 
sions was the assumption that these dimensions are represented by continuous 
interval scales in visual descriptions* Though the fine discriminations and 
smooth magnitude estimation functions found in psychophysical experiments ^ 
strongly warrant this assumption, we have reason to believe that .quantity can 
be mentally represented in other ways as well* First, there is evidence from 
experiments on the absolute identification of values on perceptual continua 
that people cannot remember verbal labels for more than about seven distinct 
levels of a perceptual continuum (Miller, 1956), and that in making rapid com- 
parisons between remembered objects, subjects' reaction times are insensitive 
to the precise values of objects belonging to distinct, well-learned categories 
(Kosslyn, Murphy, Bemesderfer, and Feinstein, 1977), Findings like these sug- 
gests that quantity can also be represented (indeed, in pernor y must be repre- 
sented, in certain circumstances) by one of a set of seven or so discrete sym- 
bols each specifying a portion of the range of quantities* These symbols could 
be signified by the Roman numerals I through VII* 

Second, it is useful to distinguish between ratio values, where quantity 
is represented continuously but the units are arbitrary, and absolute values , 
where the units are well-defined* The perception of pitch is a notorious 
example where a precise mental representation of a dimension is possible, but 
where for a majority of people, no absolute units can be assigned to the stim- 
uli* Length, on the other hand, Is an example of a continuum which people can 
judge either in ratio terms (e*g*, one object being 1*7 times as long as 
another), or in terms of the well-known lnches-fe^t-yards scale (e*g*, Gibson 
and Purdy* 1956)* Indeed, whether subjects in magnitude estimation experiments 
are asked to use a well-learned versus their own rtrbi trari ly-^eleoted modulus 
for estimated magnitude apparently affects the resulting judganents 
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(Stevens, 1957)* Thus, internal descriptions must discriminate between thes<i 

two forms of magnitude, which we will refer to as "interval --value" and 

"absolute-value 11 , though ordinarily, visual descriptions will only contain 

"interval-value" propositions* 

Finally, as every commercial sign-maker i attest, values on a continuum 

that are extreme in comparison to values of that continuum for other objects in 

■ 

a scene are very likely to be perceptually encoded (as opposed to less extreme 
values, which are apt to be encoded only if attended to)* To account for this 
salience principle, relatively extreme values will be represented redundantly 
in visual descriptions t in ordinary propositions such as "height (x, a)", as 
before, and also by special one-place predicates indicating the extremeness of 
the value along the particular dimension, such as "tall (x)", "bright (x) 1 ', 
"short (x)", etc* When capacity limitations of visual descriptions are dis- 
cussed later in the paper, it will be assumed that these special predicates 
have a very high probability of being encoded in the visual description* 
D* Coordinate Systems 

To express a uni dimensional quality like lightness, one need specify in 
advance only the origin and the units of the scale to be used* However, for 
objects that vary along a number of continua, like the position of an object on 
a two-dimensional piece of paper, or rectangles in a set varying in height a nd 
width, one has to specify how the variation will be partitioned into dimensions 
and how each dimension will be represented* This is the isue of which 
coordinate system is appropriate to represent an object in a set varying along 
several dimensions* In the case of dimensions that refer to spatial location, 
Bertin invents the term "construction schema" to refer to the way that the 
spatial dimensions of a graph are partitioned* This involves questions about 
whether a polar or a rectangular coordinate system is used, whether there is a 
single or multiple origins, and so on* In the case of nonspatial dimensions 
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like color or shape, Be r tin does not use the vocabulary of coordinate systems, 
but it is equally appropriate* We will briefly discuss some of the considera- 
tions relevant to the choice ^f coordinate systems for multidimensional stimu- 
lus, separately for nonspatial and spatial (more exactly, nonpositional and, 
positional) dimensions* 

1) Nonspatial dimensions * Many visual objects can logically be parameter- 
ized in more than one way* For example, rectangles can be classified by their 
heights and widths, or by their sizes and shaped - (where "shape" could be a 
dimension ranging from "very tall and narrow" through "square" to "very short 
and wide")* Similarly, colors can be represented by their hues, saturations, 
and brightness (e,g*, blood is saturated with a dominant wavelength of 700 nm, 
and boiled shrimp are desaturated with a dominant wavelength of 700 nm), or by 
their closeness to various "focal colors" (Rosch, 1975; here blood might be 
highly crimson and not very pink, whereas boilei shrimp would be highly pink 
and not very crimson)* 

One might expect there to be perceptual consequences of which set of 
dimensions a stimulus is encoded along, and indeed there are* Garner (1974, 
see also Chapter 3) distinguishes betwe^r separable and integral perceptual 
dimensions* According bo Garner, each of a pair of "separable" dimensions may 
be attended to independently of the other, whereas one cannot attend to one 
member of a pair of integral dimensions without attending to the other as well 
(see Chapter 3 for a discussion of the experimental procedures used to 
ascertain whether a given pair of dimensions is separable or integral)* One 
way of translating Garner's terminology into our own is to consider separable 
dimensions to be those physically defined dimensions that are also psycho - 
logical dimensions* That is, if color and size are found to be separable 
dimensions in attention tasks, we may infer that humans in fact encode objects 
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into their color and into their size, recording both aimension values as 

separate partis of their mental representations of the objects * Selective 

) 

attention is possible because the dimensions are separately represented 
internallv; one can be processed while the other ~emains in storage* Thus, the 
separabi -y of two physical dimensions is prima acie evidence that those 
dimensions are the ones used in the mental representation* 

Integral dimensions, on the other hand, may very well be psuedo-dimen- 
sions, psychologically speaking; the reason that humans apparently cannot 
ignore the height of a rectangle while attenidng to its width is that height 
and width are not the dimensions that humans, left to their own devices* would 
encode into their mental representations of a rectangel, Rather* the 
psychologically-relevant dimensions might be size and shape , in tarns of, say, 
a fatness - skinniness dimension* When asked to attend to the height of a 
rectangle, there would be no parameter or symbol in the mental encoding of the 
rectangle that represents height alone and thus that ^an be processed while 
other parameters are left alone* Rather, both the size and the fatness- 
skinniness parameter implicitly contain information about height, and both 
would have to be processed so that their values may be transformed into the 
height value that performance on task demands* This transformation process 
could account for the increase in tine required to sort stimuli along integral 
versus separable dimensions* Similarly, the reason that humans apparently 
cannot ignore the saturaiton of a color while attending to its hue may be that 
the color is not naturally encoded into separate hue £nd saturation parameters, 
but into parameters representing its proximity to various focal colors such as 
pink, red, brown, and so on* 

In sum, we may determine exactly which dimensions humans use in their 
mental representations of multidimensional stimuli by examining the results of 
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Garner-type experiments* If a pair of physically-specified dimensions is 
separable, we may conclude that there is a mental parameter corresponding to 
each of those dimensions* On the other hand, if a pair of dimensions turns out 
to b** integral, we may conclude that the mental parameters representing those 
stimuli correspond to a Jerent dimtnsionalization of the stimuli from the 
one the experimenter had in mind* Intermediate cases (e*g*, where no possible 
dimensionali zation of a stimulus set yields perfect separability) may reflect 
multiple parameterized encodings of a stimulus* the various encodings differing 
in strength of activation (see the section on Processing Constraints below)* 

2) Spatial Dimensions * In their influential paper on shape recognition, 
Marr and Hishihara (1977) proposed that memory representations of shape are 
specif ,ed with respect to object -centered cylindrical coordinate systems* 
Furthermore, the coordinate systems are distributed: instead of there being a 
global coordinate system with a single origin and set of axes, there is a 
cylindrical coordinate system centered on the principle axis of the object 
(e*g*> in the case of an animal, its torso)* in which is specified the origins 
and axeo of secondary coordinate systems centered on the various parts of the 
object attached to the principle axis (e*g*, the animal's head and liubs)* 
These secondary coordinate systems are* in turn, used to specify the origins 
and axes of smaller coordinate systems centered on the constituent or attached 
parts of the secondary part (e*g*, the thigh, shin, and foot of the leg), and 
so on* We will adopt here the following aspects of Marr and Nishihara's 
theory: 1) shapes and positions are mentally represented principally in polar 
or rectangular coordinates (the former is just a slice of a cylindrical coordi- 
nate system orthogonal to its axis; the latter is just a slice of a cylindrical 
coordinate system including its axis)* 2) The locations of the different 
elements of a scene are represented in separate* local coordinate systems 
centered upon, other parts of the scone* not in a single, global coordinate 
systen* This means that in the visual description, the specification of 
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locations (and also of directions and of parameterized shapes) of objects will 
be broken down into two propositions, one specifying the object upon which the 
coordinate system will be centered, the other specifying the extent or value of 
the object within the coordinate system, is in F ; -tre 6*10* 

INSERT FIGUK. 6*10 Hh.<E 

In fact, it is generally more perspicuous to indicate the extent along 
each dimension, and the location of the axis of the coordinc.te system corres- 
ponding to tht* dimension, separately, as in Figure 6*11 

INSERT FIGURE 6*11 HERE 

The important question of which objects may serve as the coordinate system 
for which other objects has received little attention in the vision literature, 
but the following condition seems to be a plausible first approximation: the 
location (or direction, or shape parameters) of object a will be mentally 
specified in a coordinate system contered on object b when* 1 ) b_ is larger 
than ^a, and 2) _a and b are perceptually grouped according to one or more of the 
Gestalt laws* 

Processing Constraints on Visual Descriptions 

Since, with deliberate effort, people can probably encode an unlimited 
number of properties (e*g*, theangle formed by imaginary lines connecting a 
standing person's right thumbnail, navel, and right kneecap), visual descrip- 
tions can in principle be arbitrarily large* In practice, however, two factors 
will limit the size of visual descriptions; 

1) Processing Capacity * Most models of cognitive processing have re- 
strictions on the capacity to maintain the activation of ncdes in a short-term 
visual description (Anderson & Bower, 1973; Newell & Simon, 1973)* Specifi- 
cally, it is claimed that between 4 and 9 nodes may be kept active at one time 
(see Chapter 2), fewer if processing resources are being devoted to some con- 
current task. This limitation reflects the well-known finiten^ss on human 
immediate memory and processing capacity* 
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2) Default Encoding Liklihood and Automaticity . As mentioned, any predi- 
cate in a person's visual repertoire can be added to a visual description in 
response to higher-level processes testinq for the presence of a particular 
predicate applied to a particular variable (e.g., "is a square?")* However, 
before these top-down processes come into play, a number of predic -es will be 
assembled into a visual description, because they are "just noticed". Differ- 
ent predicates have different probabilities of being encoded under these 
"default" circumstances. Presumably, some predicates innately have a high 
default encoding likelihood [e.g., enormous <x) , dazzling (x) ] whereas the 
default encoding likelihood of others is 'determined by familiarity and learned 
importance, shiffrin and Schneider C 1 971 ) and Schneider and Shiffrin (1977) 
propose that when a person frequently assigns a visual pattern into a single 
category, he or she will come to make that classification "automatically", that 
is, without the conscious application of attentional or processing capacity. 
Translated into our vocabulary, this means that frequently-encoded predicates 
will have a high default encoding likelihood . A number of experiments applying 
Shiffrin and Schneider's proposals to the learning of visual patterns confirm 
that the recognition of patterns becomes ra^id, error-free, and relatively 
insensitive to other attentional demands as tne patterns become increasingly 
well-oractised. 

Therefore, it is important to distinguish among several sizes of visual 
descriptions. A description that is assembled automatically by purely data- 
driven (as opposed to top-down or conceptually-driven) encoding processes will 
be called the "default visual description". Its composition will be determined 
by the relative "default encoding likelihoods" of the various predicates satis- 
fied by the visual array. In contrast, a description that is shaped by concep- 
t ial processes testing frr the presence of visual predicates at particular 
locations in the *rray will be called an "elaborated visual description". 
Visual descriptions can also be classified in . terms of whether short-term mem- 
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ory limitations are assumed to be in effect* A small visual description such 
as can be activate! at a given instant will be called the "reduced visual 
description"; a visual description that includes all the predicates whose 
default encoding likelihoods are above a certain minimum, * all the predi- 

cates that are successfully tested for by top-down processes, will be called 
the "conplete visual description"* The complete visual description will 
correspond to the description encoded by a hypothetical graph reader with un- 
limited short-tem memory, or to the description integrating the successive 
reduced descriptions encoded by a normal graph reader over a long viewing 
period* One way to think quantitatively of the size of the default visual 
description that a person will encode is to suppose that the probability of a 
given true predicate's entering into a visual description is a function of its 
default encoding likelihood multiplied by a constant between zero and one 
corresponding to the amount of capacity available (i*e** not devoted to other 
concurrent tasks)* when the constant is one, the resulting description will be 
a "complete" visual description? as the constant decreases with decreasing 
available processing capacity, the size of the description will be reduced 
accordingly* We adopt the final assumption that the level of activation of a 
node begins to decrease steadily as soon as it is activated, but that the read- 
er can repeatedly re-encode the description by reattending to the graph (see 
the voluminous literature on decay and rehearsal in short-term memory summar- 
ized, for example, in Crowder, 1975)* since encoding is probabilistic, the 
description will differ in composition somewhat from one encoding to the next* 

V* An Example 

Now that we have some constraints on the size and composition of visual 
descriptions, we can examine how a particular graph might be described mental- 
ly* This will be the first step in working through an example of how a graph 
is understood according to the current theory* The? example, shown in Figure 
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6*12, is a bar graph plotting the price per ounce of a precious metal we will 
call "graphium" over a six month period* 

ItfSERT FIGURE 6*12 HERE 
A ^complete" default visual description is shown in Figure 6*13, (Dotted lines 
represent propositions, om: 1 A .for the sake of clarity, that may be deduced 
from nearby propositons for similar parts*) 

INSERT FIGURE 6.13 HERE 
Most aspects of this visual description are "motivated by the constraints 
outlined in the previous secton* The scene is parsed into subscenes, each 
occupying a distinct location in the visual array (though for readability's 
sake, the locations for the subscene nodes will not always be printed in the 
future)* This parse is done according bo the Gestalt principles, yielding 
separate nodes for the "L"-shaped framework and for the group of bars* Sy 
those sane principles, the framework is connected by the "part 11 predicate to 
nodes representing its vertical and horizontal segments, and each of these is 
linked by "near" predicates to nodes representing the conceptual meaning of 
that text* Of course, the meaning of expressions lijce "price of graphium" is, 
in all likelihood, mentally represented by an assembly of nodes linked in com- 
plex ^ays to the nodes representing the visual appearance of the text, but 
since the process of reading, text is not our concern here, this simplified 
notation will suffice (the predicate associated with these "meaning" nodes will 
be replaced within quotation marks to indicate that they are not in fant uni- 
tary predicates)* Predicates for the "bar" shape are attached to each bar 
node; the "tall" predicate is attached to the salient tallest bar; a pair of 
particularly discrepant bars is connected by the predicate "talier-than" *, and 
th* set of four progressively shorter bars is grouped together under its own 
rode with its own shape predicate "descending-staircase*" Finally, the height 
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and horizontal position of each bar is specified with respect to a coordinate 
system centered on the appropriate framework segment, due to the framework's 
being larger than the bars and associated with them by proximity and common 
fate* 

VI* Conceptual Messages, Conceptual Questions 
We now have an example of the immediate input tc the graph comprehension 
process* Before specifying that process* it would be helpful to know what its 
output is as well* One can get a good idea of ^hat that output must be simply 
by looking at a graph and observing what one remembers from it in the first few 
moments of seeing it or after it has Just been removed from view* In the case 
of the graph in Figure 6*1 2, one might notice things like the following: a) 
the price of graphium was very high in March; b) the price was higher in March 
than in the preceeding moath; c) the price steadily declined from March to 
June; , d) the price was $20/ounce In January/ e) the price in June was x: 
(where x is a mental quantity about half of that for January, about a fifth of 
that for May, etc*)* Basically, we have a set of paired observations here, 
where the first member can be a particular value of the indepenJent variable 
(e*g», "March"), a pair of values (e*g*, "March vs* February"), or a range of 
values (e*g», "the last four months")* The secoad number of each pair can be a 
ratio value (e*g», a value x" along some meatal ratio scale), an absolute value 
(e*g*, "$20/ounce"), a difference (e*g*, "larger"), a tread (e*g», "decreas- 
ing"), or a level (e*g*, "high")* Bertin first jointed out these options, 
* 

using the term "elementary questions" for those referring to single values, 
"intermediate questions" for differences, and "superior questions" for trends* 
This information can be expressed in a representation coasisting of a list of 
numbered entries, each specifying a pair (or, for more comp} ax graphs, an n- 
tuple) of variables, the extent or type of each independent variable (e*q*, 
ratio-value, pair, range), and the value (or difference or trend) of the cor- 
responding dependent variable* Thus, the conceptual message representing the 
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information which we are assuming has been extracted from the graph in Figure 
6*12 will look like this (the intuitive meaning of each entry can be made 
clearer by assuming the entry is a sentence beginning with the word when): 



absolute-value = March, V 2 level = high 
pair = March s February, difference = t er 

■| range * March - June, V 2 trend = decreasing 
V[ absolute-value - January/ absolute-value = ?20/o2* 

V 1 absolate-value = June, V 2 ratio-value = x* 



In general/ conceptual messages will be of the following form: 
i: ratio-*value =* tx, ratio-value. » ft, * ** 

or or 

absolute-value absolute- value 
or or 
pair pair 
or or 
range range 

i designates the ith of an arbitrary number of entries (in principle), V 
— — — a 

designates the ath of an arbitrary number of variables/ and a designates a 

specific value in'a form appropriate to the entry (e.g./ a "higher 1 * or "lower" 

primitive symbol if the entry specifies a difference between values of the 

second variable corresponding to a pair of values of the first) 4 ** Note that 

the variables are differentiated by subscripts instead of being named by their 

real-world referents (e.g., "month"); this was done in recognition of people's 

ability to extract a great deal of quantitative and qualitative information 

(indeed, virtually the same information) when a graph has no labels at all/ 

leaving the referents of the variables unknown* When the referents are known, 

the conceptual message can indicate this with entries like the following; 

6; = months, - price-of-graphium* 



^Xt is possible to have several equations in an entry refer to the same varia- 
ble, eg*: 

17: Vi absolute-value = 14, V| ratio- value c 132, V[ level - high, 
level = low* 
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Presumably, when the reader has integrated all the information he or she wishes 

to extract from the graph/ he or she can make che message representation more 

economical by replacing each V, by its associated referent symbols 

— 1 

From nere, it is a s* matter to devise a notation for conceptual ques- 
tions* (Recall that a coti^e^tual question is a piece of information that the 
reader desires to extract from a graph*) One can simply replace the _a or jj in 
the generalized entry presented above by the "?" symbol/ indicating that that 
is the unknown but desired information* Thus, if a person wishes to learn the 
price of graphiun during the month of April/ we posit that he or she has acti- 
vated the representation 

7; absolute-value = April/ V 2 absolute value = ?* 
If the reader vishes to learn the trend of graphium prices during the first two 
months, he or she sets up the representation 

8j range = January-February, V£ trend = ? * 
If the reader wishes to learn the month in which graphium prices were low/ he 
or she activates 

9: V| absolute-value = ?, V 2 level = low, 

and so on* 

VII* The Graph Schema 
So far, our theory has implicated an information flow diagram like the one 
in Figure 6*14* - 

INSERT FIGURE 6*14 HERE 
How / we must specify the unknown component labelled with a "?"* From the 
flow ch^rt, we can see what this component must do: 1) it must specify how to 
translate the information found in the visual description into the conceptual 
message, and 2) it must specify how to translate the request found in a con 
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ceptual question into a process that accesses the relevant parts of the visual 
description (culminating as before in one or more entries in the conceptual 
message)* Furthermore, since (1) and (2) will involve different sorts of tran- 
slations for different types of graphs (e.g., for line graphs versus bar 
graphs), the unknown component will also havt \* 3) recognize which type o£ 
graph is currently being viewed. The structure that accomplishes these three 
tasks will be called a graph schema , and it, together with the processes that 
work over it, will be discussed in this section. 
A. schenas 

A schema is a memory representation, embodying knowledge in some domain, 
consisting of a description containing "slots 11 or parameters for as yet unknown 
information. -Thus, a schema can specify both the information that must be true 
of some represented object of a given class, and the sorts of information that 
will vary from one exemplar of the class to another (For detailed presentations 
of various schema theories, see Mi'nsky, 1975? Winston, 1975; Norraan £ Rumei- 
hart, 1975; Bregraan, 1977* Schank £ Ableson, 1977). TO take a simple example 
unrelated to graphs, Figure 6.15 could be a schema for telephone numbers, 
specifying the number and grouping of the digits for any number but not the - 
identity of the digits for any particular number, these being represented by 
the parameters a-j . 5 

INSERT FIGURE 0.15 HERE 
This schema can be instantiated for a givi*n person/ becoming a representa- 
tion of his or her particular telephone nuirber, by replacing the parameters 
labeling the lowermost nodes by actual numerical predicates. In doin^j so, one 
is using the schemi to recognize a candidate character string as a telephone 



v These upper case parameters , which stand for unknown predicates, should not be 
confused with lower case variables * which stand for perceptual entities and 
correspond to nodes in the visual description (although usually, the variable 
itself is omitted and only the node is depicted). 
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number, by matching the schema against a visual description of the candidate 
string* The visual description of an as yet unrecognised number will be iden- 
tical to the schema, except that it lacks the conceptual nodes like "area code" 
and "exchange" and that it contains constants in r" ^e of parameters* Once the 

schena is instantiated by the visual description, one can use it to retrieve 

V 

desired infornation about the telephone number using a node-by-node net search- 
ing procedure (i*e*, one can quickly find "the first digit of the exchange" 
without searching the entire string, by starting at the top node and following 
the appropriate arrows down until the bottom node labeled by the desired number 
is reached)* The double labeling of nodes is what allows schemas to be used 
both for recognition an3 for searching; a visual description of a to-be- 
recognised pattern will contain labels like "digit", but not "area code", so 
the "digit" labels in the schema are necessary for recognising the object* 
However, the search procedures will be accessing conceptual labels like "area 
code", so these are necessary, too* 
B* Graph schemas; A Fragment 

It seems, then, that a schema of this sort for graphs might fulfill two of 
our three requirements for graph knowledge structures; recognising specific 
types of graphs^ and directing the search for desired pieces of information in 
a graph* what we now need is some device to translate visual information into 
the quantitative information of the type found in the conceptual message* 
These devices, which we will call message flags , consist of conceptual message 
equations, usually containing a schema parameter, which are appended to predi- 
cates (nodes or arrows) in the graph schema* When such a node or arrow is 
instantiated by a particular visual description for a graph, the parameters in 
tha message flag are replaced by the corresponding value in the instantiated 
schena, fi*\<l tho equation is added to the conceptual message* Figure 6*16(a) 
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illustrates equation flags for a fragment of a bar giraph schema (the flags are 
enclosed in rectangles, and are attached to the nodes they flag by dotted 
lines)* 

INSERT FIGURE 6*16 HERE 
<en a reader encounters the graph represented by the fragment of a visual 
description in Figure 6*16b (the numbers representing values along a mental 
ratio scale with arbitrary units), he or she can instantiate the schema (i-<**, 
replace the parameters h and B by the values 4 and 37), and add an entry to the 
conceptual message* All^quations sharing a given i_ prefix are merged into a 
single entry, and each i_ is replaced by a unique integer whej^ the entry is 
added to the, conceptual message. Thus, the following entry is created; 

1; ratio-vali:e = 4, V 2 ratio-value » 37 
This informal sketch should give the reader a general idea of how the graph 
schema is used in conjunction with the visual description to produce a concep- 
tual message* In the sections following, we present a comprehensive bar graph 
schema, and define more explicitly the processes that use it* 
C* A Bar Graph Schema 

Figure 6*17 presents a substantial chunk of a schema for interpreting bar 
graphs* it is, intentionally, quite similar to the visual description for a 
bar graph in Figure 6*13* The graph is divided into its L-shaped framework and 
its specifier material, in this case, the bars. The frameworJc is divided into 
the abscissa and the ordinate, and each of these is subdivided into the actual 
line and the text printed alongside it* In addition, the "pips" cross-hatching 
the ordinate, together with the numbers associated with them, are listed expli- 
citly. The height and horizontal position of each bar are specified with 
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respect to coordinate systems centered on the respective axes of the framework/ 
and each bar is linked to a node representing its nearby text* An asterisk 
followed by a letter inside a node indicates that the node/ together with its 
connections to other nodes, can be duplicated any number of +* J mes in the visual 
description* The letter itself indicates that each duplication of the node is 
to be assigned a distinct number/ which will appear within tt)e message flags 
attached to that instance of the node* 

INSERT FIGURE 6*17 HERE 
Tne message flags specify the conceptual information that is to be "read 
off" the instantiated graph schema* They specify that each bar will contribute 
an entry to the conceptual message* Each entry will equate the ratio value of 
the first variable (referred to in the description as "XV", for Independent 
Variable) with the horizontal position of the bar with respect to the abscissa/ 
and will equate the second variable (the "DV", or Dependent Variable) with the 
bar's height wi*th respect to the ordinate* In addition/ the absolute value of 

the independent variable for an entry .will be equated with the meaning of what- 

X 

ever label is printed belc it along the abscissa* Finally/ the referents of 
each variable will be equated with the meaning of the text printed alongside 
its respective axis* 

In devising these formalisms* we were at one point distressed that there 
was no straightforward way to derive absolute values for the dependent vari- 
able* The ratio value of each bar, corresponding to its height, could easily 
be specified/ but since the absolute values are specified in equal increments 
along the ordinate* far from most of the bars, and specific to none of them, no 
simple substitution process yill do* However/ a simple glance at a bar graph 
should convince the reader/ as it convinced us/ that this is not a liability 
but an asset* The absolute valuo of the dependent variable at a given level of 
the independent variable is indeed not immediately available from a bar graph* 
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Instead, one seems to assess the height of a bar in terms of some arbitrary 
perceptual or cognitive scale, and then search for the pip along the ordinate 
whose vertical position is closest to that height* The number printed next to 
that pip, o" ^ number interpolated between the numbers printed next to the two 
nearest pip^ is deduced to be its absolute value* in contrast, the absolute 
value of a given level of the independent variable (i*e., which month it is), 
or the relative values of the dependent variable (e*g., its maximum and minimum 
values, its trends, or differences between adjacent values) seem available with 
far less nental effort* The most natural mechanism fcr representing absolute 
values of the dependent variable within the bar graph schema, and the one that 
happens to be in accord with the actual difficulty of perceiving these values, 
is to add to the conceptual message special entries asserting an equivalence 
between a certain level of the referent's absolute value and a certain level of 
the referent's ratio value, each entry derived from a labeled pip on the ordin- 
ate* The leftmost message flag in Figure 6*17 sets up these entries; the 
symbol "=" indicates that the two equations are equivalent* Presumably, 
higher-level inferential processes, unspecified here, can use these equivalence 
entries to convert ratio values to absolute values within other entries in the 
conceptual message, calculating interpolated values when necessary* 6 

Earlier, we mentioned that the visual system can encode predicates that 
stand for well-defined groups of objects, and also that conceptual messages can 
contain entries specifying a trend of one variaole over the range of another* 
An implication of r the theory, then, is that graph readers Cor at least experi- 



b The schema presented here perhaps unfairly anticipates that the bar graph 
example will have individual labels for each bar along the abscissa; and a ■ 
graduated scale along the ordinate* In fact, graduated scales often api>oar 
along the abscissas of bar graphs as veil* In 'a more realistic bar graph 
schema, the subschema for the pips of a graduated scale would be appended to 
the abscissa as well as to the ordinate* 




ERJC 243 



enced graph readars) should be able to translate direc' Ky a higher-order per- 
ceptual pattern, such as a group of bars comprising a staircase, into the 
quantitative trend that it symbolises, without having to compute the trend by 
successively examining each element* Furthermore, the difference i r v ^ight 
b^tveen a pair of adjacent baj. might be encodable into a siiyjle predicate, 
which should be directly translatable into an entry expressing a difference in 
the symbolised values* Also, a salient perceptual entity might be encoded as 
extreme (independently of the encoding of its precise extent on a ratio scale), 
-and this should be directly translatable into an entr^ expressing the extreme- 
ness of i 1 ^ corresponding variable value, again without the mediation of ratio 
scale values* These direct translations, which, as we shall see, play an im- 
portant role in predicting the difficulty of a graph or the effectiveness of a 
graph reader* are accomplished by the message flags in Figure 6*18 (which 
should actually be part of Figure 6*17, but is depicted separately for th* sake 
of clarity)* Figure 6*13 shows that bars in a graph can be described not only 
in terns of their heights and horizontal positions, but also in terms of being 
extremely tall or short, in terms of differences between the heights of adja- 
cent pairs, or in terms of groups that constitute a perceptual whole* In each 

, INSERT FIGURE 6*1 8 HERE 

case the appropriate 'equation is attached the predicate which encodes the 
attribute* Two additional notational conventions are introduced in the figure: 
the location of a pattern that occupies an extended region of the array is 
specified by its endpoints along a ratio scale (i.e*, "H-r 1 ) , both in the visu- 
al description and in the conceptual message* m addition,- one~of the equation 
flags for a pair of bars makes reference to nodes standing for the bars them- 
selves* Pj and s{y rather t^an for an attribute like horizontal position* it 

is assumed thai when a pair of bars is encoded as a p^ir, some information 
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about each bar is encoded as well* This information, be it ratio value, 
absolute value, or level, can then be iinked with or substituted for 
appropriate symbols for the bars t£j or ) within the entry for the pair- 

VIII* Processe s 

In the account so far, we have relied upon the intelligence and coopera- 
tiveness of the reader to deduce how the various structures are manipulated and 
read during graph comprehension* In order to use the theory to make predic- 
tions, it will be necessary to define explicitly the procedures that access the 
structures representing graphic information* Four procedures will be defined: 
a MATCii process that recognizes individual graphs as belonging to a particular 
type, a message assembly process that creates a conceptual message out of the 
instantiated graph schema, an interrogation process that retrieves or encodes 
new information on the basis of conceptual questions, and a set of inferential 
processes that apply mathematical and logical inference rules to the entries of 
the conceptual message* 
A* The MATCH Proces s 

The term is borrowed from Anuerson and Bower's (1973) theory of long-term 
memory* This process compares a visual description in parallel with every 
memory schema for a visual scene, computes a .goodness-of-f it measure for each 
schema (perhaps the ratio or difference between the number of matching nodos 
and predicates and the ^ ^ber of mismatchirg nodes and predicates), and selects 
the schema with the highestjgoodness-of-fit measure* This schema, or rather, 
the subset of the schema that the limited capacity processes can keep activa- 
ted, is then instantiated (i*e. the parameters in the schema are replaced by 
the appropriate constants found in the visual description*)* This is the pro- 
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cedure alluded to in vague terms before, that uses the graph schema to recog- 
nise a graph as being of a certain type (e.g., bar graph, pie graph)* 7 
B. Message Assembly 

This process accomplishes the translation from visual info*" ^tion to con- 
ceptual information, also alluded to in previous sections* It starches over 
the instantiated graph schema, and when it encounters a message flag, it adds 
the message it contains to the conceptual message, combining into a single 
entry all equations sharing a given prefix (i*e*, all those beginning with the 
same " i : " ) * it is assumed tnat at the time that the MATCH process instantiated 
the parameters of the graph schema, the parameters within the message flags 
were instantiated as well* 

Memory and processing limitations imply that not every message flag in the 
graph schema is converted into an entry in the conceptual message; some may 
not be instantiated because the visual description was reduced, or because the 
default encoding likelihood of the predicate was low; some may not be instan- 
tiated because of noise in the MATCH process; and some may be skipped over or 
lost because of noise in the message assembly process* For these reasons, we 



/ This process has been oversimplified in several ways, in accordance with cer- 
tain oversimplifications in the graph schema itself , For one thing, concep- 
tual labels like "abscissa" do not appear in visual descriptions, and so 
should not count in the goodness of fit calculations* This could be 
accomplished by distinguishing the conceptual or graph-specific predicates 
from the rest, perhaps by listing theta, too, as message flags, which are "read 
off" the schema, but not used to instantiate it- The second complication is 
that different nodes and predicates should count differently in the recogni- 
tion process* Some might be mandatory, some might be mandatorily absent, 
some mi'jht be characteristic to various degrees, some might occur in sets from 
which one member must occur, and so on* There are several ways of 
accomplishing this, such as the introduction of logical operators into 
schemas, or the use of a Dayesian recognition procedure, but limited space 
prevents us from outlining them here (see Anderson, 1970; Anderson £ Bowor, 
1973; Minr>ky, 1975; Winston, 1975; Smith, Shoben, £ Rips, 1973)* 
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need a process that adds information to the conceptual message in response to 
higher-Level demands*- 
C. Interrogation 

This process is call* 3 into play when the reader needs some piece of in- 
formation that is not cun ,ntly in the conceptual message (e.g., the difference 
between two values of the dependent variable corresponding to a given pair of 
independent variable values)* As mentioned, each Such request can be expressed 
as a conceptual message entry with a replacing one of the equation values* 
The interrogation process works as follows; the message flag within the graph 
schema that matches rhe conceptual question (i.e., is identical to it except 
for a constant or parameter in the place of the "?") is activated. If it 
already contains a constant (i.e., if the equation it contains is instantiated, 
and thus, complete), the equation is simply added to the conceptual message. 
If it contains a parameter (i.e., is incomplete), the part of the visual des- 
cription *:hat corresponds to that branch of the schema is checked to see if it 
contains the desired constant (e.g,, if a certain ratio-value of the dependent 
variable is desired, the visual description is checked for the presence of a 
constant attached to the node representing the bar's height). If this constant 
is absent from th*? visual description, the encoding process for the relevant 
predicate (e.g., the process that encodes height) is commanded to retrieve the 
desired information f c r the relevant part in the visual array. It can do so by 
using the retinal coordinates attached to the node for the part which are 
assumed to be present in the; visual description (though they have been omitted 
from the diagrams in this chapter). Often, however, th^e coordinates will 
hava decayed, and the coordinates of an assoc* ate3 part together with the 
d^gr»e and direction of rhe association will b* used to direct tho encoding 
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process to the correct location in the visual array* in other words, the con- 
ceptual question can initiate a top-down search for the desired part or part 
parameter in the array* Once the desired information is encoded into the 
visual description, it can be instantiated in the schema and its - ^ge flags, 
and the instantiated equation within the flag can be added to the conceptual 
message* 

D* Inferential Processes 

Hunan intelligence consists of more than the ability to read graphs* in 
the category inferential processes , .e include the ability to perform arith- 
metic operations on the quantitative information listed in the conceptual 
message (e*g*, calculating t! 1 rate of increase, of a variable by subtracting 
^ one value from another value and dividing by a third value), to infer from the 

context of the graph (e*g», the paragraph in vhich it is embedded) what 
information should be extracted from the graph, to draw qualitative conclusions? 
relevant to some domain of knowledge based on the information in the graph, and 
so on* Naturally, we have little to say about, these abilities; they are part 
of the study of cognition in general and not the study of graph comprehension* 
However, we mention them here because roany types of information can be obtained 
either directly from a conceptual message or indirectly from inferential pro- 
cesses operating on the conceptual message* which method is used, we shall 
see, affects the difficulcy of a graph and the efficiency of a graph reader* 

The flow of information specified by the current thoory is summarized in 
Figure 6*19, where blocks represent information structures, and arrows 
represent processes that transfer information amoncj them* 

INJECT FIGURE 6.19 HERE 
IX* where do Graph SchcmaS Come From? 

Tho 'jr.iph ^chc*na dir>juss^<J so far emboJL'2s knowledge of bar graphs (in 
fact, a subset of bar graphs). Clearly, the theory must alr>o account for 
p^ople 1 ^ ability to read othor common types of graphs (llns graphs* pia graphs, 
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pictograms, etc*) and to understand completely novel forms of graphs as well 
(e.g., one in which the length of a ray of light emitted from a disc represents 
the price Q f gold in a given month)* We propose people create schemas for 
specific tvpes of graphs using <=> g eneral graph schema , embodying their know- 
ledge Q f at graphs are for ana now they are interpreted in general* A plaus- 
ible general graph schema is shown in Figure 6*20* There are three key pieces 
of information contained in the schema* First, some objects, or parts of 
objects (specifier material) are described m terms ot several visual 
attributes* Each visual attribute symbolizes a conceptual variable, and the 
set of values of the n_ visual attributes encoded for an object or object part 
corresponds to a particular n-tuple of associated values of th£ respective 
conceptual variables for a given conceptual entity* Second, the ratio mag- 
nitudes of attributed are usually to be specified in terms of a coordinate 
system centered upon a part of the graph framework* Third, textual material 
perceptually grouped with an object specifies the absolute value of the object; 
textual material perceptually grouped, with the framework Specifies the real- 
world referent of the attribute that the coordinate system centered on the 
framework helps to specify; textual material associated with specific local 
regions G f the framework specifies pairings of absolute and ratio values of the 
attribute specified by the associated coordinate system* In other words, the 
general graph schena encodes knowledge of graphs in a way that 

INSERT FIGURE 6*20 HERE 
respects the basic assumptions underlying our analytic schemo (Chapter 2), in 
which graphs are parsed into specifier material, a framework, and a set of 
labels* lio^r? that for maximum generality, text is linked to perceptual enti- 
ties by the Plicate "associated"/ which can symbolize proximity, similarity, 
continuity , and so on* Thi.s helps to enconpass graphs with parts directly 
labelled, o^l graphs exploiting common colors or shapes in Hoys anj legends* 
Similarly, th*> predicate "attribute' 1 is meant to encompass length, width, 
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orientation, lightness* color, etc* However, the indispensibility of visual 
space motivates "geometric shapes" as opposed to arbitrary visual predicates 
being specified as typical frameworks* and spatially localizable "parts" being 
specified as the units over which attributes are defined* 

In encountering a cer ±n type of graph for the first time/ a reader will 
generate a specific graph schema for *t using the general graph schema* The 
reader will have to replace the predicates "specifier material", "associated", 
"attribute 1 ', "geometric figure"/ and so on/ by the actual visual predicates 
found in the visual description of the novel graph* This will be possible when 
the visual description has a structure similar to that of the general graph 
schema, with objects described ir* terr.> of attributes defined with respect to a 
framework, and textual labels associated with each* In addition/ an astute 
graph reader will add to the new specific graph schema higher-order predicates 
(e*g*/ "descending-staircase 11 ) that can be taken to symbolize global trends 
(e*g*z a decrease in the dependent variable)* However, the availability of 
these higher order predicates, and how transparently they symbolize their 
trends, will differ arbitrarily from graph type to graph type, and so these 
predicates cannot be included in any simple way within the general graph schema 
but must be created case-by-case* This process will be discussed in more 
detail in the section describing what makes a graph reader efficient* 

Pushing the question back a step/ we may ask/ "Where docs the general 
graph schena come from?" This question is more profound, and the'ans^r to it 
is correspondingly murkier* In one sense, one could answer that people are 
explicitly taught how to read certain types of graphs* But/ this still leads 
one to wonder how people can generalize from the small set of graph types that 
th^y ^ro exposed to in school (basically, bar graphs, line graphs, pie fjra^hi, 
ani pictoqr*iffls) to the myriad exotic forms that are created and easily under- 
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^ stood in, say, TIME magazine* This is especially problematic given that formal 

instruction in graph Reading does not teach the abstract concepts such as 
i 

"attribute"* "extent", "ratio value", and so forth, that in fact define what 
^ all graphs have in common* A deeper answer to this question, then, would seem 

to In a basic human ability to associate a b ^xe of values (i*e*, an attri- 
bute with an "extent" predicate) .in one domain with a scale of values in virtu- 

# ally any other domain, so long as the "positive" end of one scale, as mentally 
represented, coincides with the "positive" end of" the other* Thus, there are 
lawful relations governing such diverse phenomena as the order of words with 

♦ different sounds in conjoined phrases, the choice of which member of a pair of 
associated symbols or metaphors will represent specific ideas, and which way of 
installing a switch or gauge will yield the most efficient man-machine inter- 

© action* See Cooper and Ross (1975), and Pinker and Birdsong Cl 979) for dis- 

cussions of some of these principles and their significance* 
X* The Difficulty of Comprehending a Graph 

• In this section/ we consider what makes different types of graphs easy or 
difficult w hen particular types of information have to be extracted (by "type 
of information", we ar« referring to different conceptual questions, such as 

# ones referring to ratio values vs* differences vs* trends*)* 

Aside from the limitations of the peripheral encoding mechanisms (i*e*, 
limits on detectability, discriminability, and the accuracy of encoding magni- 

• tudes), the structures and processes described here permit any quantitative 
information whatsoever to be extractible in principle from a graph* This is 
because no information is necessarily lost from the visual description "up- 

0 ward", and because there are no constraints on what the inferential processes 

can do with the information in the conceptual message* 

In pr^cti^o, though, limits on short-term memory and on processing resour- 

# cgs vi 11 make different sorts of information easier or more difficult to ex- 
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tract* We havo assumed that the visual description that is encoded is/ in 
fact, is a small subset of the complete visual description, and that noise in 
the MATCH and message assembly processes causes only a subset of that reduced 
visual description to be translated into conceptual message information. The 
remaining conceptual message entries will contain the infor. .on that .> 
"easily extracted" from a graph, since a simple lookup procedure suffices to 
retrieve £he information* On the other hand, if the desired information is not 
already in the conceptual message/ it will have to be generated either by the 
top-down interrogation process, which adds entries to the conceptual message, 
or by the inferential processes/ which perform computations on existing en- 
tries- Each of these processes can involve a chain of (presumably) capacity- 
limited computations, and each process properly includes the lookup of informal 
tion front the conceptual message* Therefore, they are necessarily more time- 
consuming and memory -consuming (since the results of intermediate computations 
must be temporarily stored) than the lookup of existing information in the con- 
ceptual message* And, in a limited-capacity/ noisy system like the human mind* 
greater time and memory requirements imply increased chances of errors or 
breakdowns, hence, increased difficulty* We can call this conclusion the Graph 
Difficulty Principle s A particular type of information will be harder to ex- 
tract from a given grsph to the extent that inferential processes and top-down 
encoding processes, as opposed to conceptual message lookup, must be used* 

This kind of principle is to be distinguished from the "operating ~tnci- 
ples" introduced earlier; those principles specified properties of the display 
itself which must be respected if a graph is to communicate effectively* This 
kind of principle explains why those particular stimulus properties have the 
effects thsy do; namely because of the structure and limitations of tho human 
visual inforndtLon processing system* In particular, the effects of violations 
of the earlier principles of discciminabllity/ distortion, gestait organiza- 



252 



tion, integral/separable dimensionality and limited capacity (unit binding), 
all can be understood by reference to unfortunate effects on visual descrip- 
tions, and the effects of the remaining operating principles (with the exclu- 
sion of the two forra.il ; nciples, which are not grounded in the properties of 
information processing per se) can be understood by reference to problems in 
Hatching description and stored schemata anj/or deficiencies in the schema 
themselves. 

There will, in turn, be two factors influencing whether a desired type of 
infornation (i,e w the answer to a gi ven ^conceptual question) will be present 
in a conceptual message* First, a message entry will be assembled only if 
there are nessage flags specific to that entry appended to the graph schema* 
That, in turn, will depend on whether the visual system encodes a single visual 
predicate that corresponds to that quantitative information* For example, we 
have assumed that a bar graph schema appends message flags to predicates for 
* sight, horizontal position, extremeness in height, extreme differences in 
height between adjacent objects, and extended increases or decreases in height* 
This respectively makes ratio values of the dependent and independent 
variables, extremeness in value, extreme differences in values, and global 
trends easily etftractible* On the other hand, there is no visual predicate for 
an object being a given number of ordinate scale units high, or for one bar's 
height to be a precise ratio of the height of another, cr the leftmost and 
rightmost bar3 to be of the same height, and so on; therefore, there can be no 
message flags for and no conceptual message entries for the absolute value of 
the dependent variable, the t xact ratio of dependent variable values 
corresponding to successive values of the independent variable, or the equality 
of dependont variable values corresponding to the most extreme independent 
variable values* If a reader wishes the gra[;h to answer these conceptual 
questions, he or she can expect aore difficulty than for the conceptual ^ 
questions discussed previously* 
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The second factor influencing whether a conceptual message entry will be 
assembled ^s the encoding likelihoods of the predicates attached to the corres- 
ponding equation flags in the 9raph schema* In the example we have been using, 
if the predicate "descending-staircase" has a very low default encoding likeli- 
hood, and hence is absent from the visual description on most occasions, the 
entry specifying a decreasing trend will not find its way into the conceptual 
message until interrogated explicitly* Incidentally, apart from innateness and 
automaticity factors, the encoding likelihood of a predicate may also 
be influenced by "priming"; when a graph schema is activated (i*e*, when the 
graph is recognises as being of a particular type)/ the encoding likelihoods of 
the visual predicates are temporarily enhanced or "primed" (see Morton, 1969)* 
In other words, when a graph is recognized on the basis of partial recognition, 
the schena makes the rest of the information more likely to be encoded for as 
long as the schema is activated* 

As simple as the Graph Difficulty Principle is, it helps to explain a wide 
variety of phenomena concerning the appropriateness of different types of 
graphs for conveying different types of information* Consider Cartesian line 
graphs, for example* The English language has a variety of words to describe 
the shapes of lines: straight, curved, wiggly, V-shaped, bent, steep* flat, 
jagged, scalloped, convex, smooth, and many more* It also has words to des- 
cribe pairs of lines; parallel/ intersecting, converging, diverging, inter^ 
twined, touching, X-shaped, and so on* It is safe to "assume that the diverse 
vocabulary reflects an equally or more diverse mental vocabulary of visual 
predicates for lines, especially since the indisp^nsibility of visual space 
inplies that predicates for configural spatial properties like shape should be 
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readily available* The availability of these predicates affords the possibili- 
ty of a line graph Schema with a rich set of message flags for trends. For 
example, if 'V 1 and "y" are nodes representing lines on a graph, with Vj the 
abscissa, the ordinate, and V 3 the parameter, the propositions on the left 
side of Table 6*1 can be flagged with the-conce^ .al message equations on the 
right side of the table*; 

Table 6*1 ' 
Predica te Equation Fla g 

Flat tx) V 2 trend = unchanging 

Steep (x) V'i trend = increasing - rapidly 

Inverted U-shape (x) V 2 -trend = quadratic ? 

U-shape (x) V2 trend = quadratic 

Jagged (x) V 2 trend = random 

Undulating (x) V£ trend = fluctuating 

Straight {x) V 2 trend = linear 

S-shape (x) trend = cubic 

Rectilinaer (x) trend = abruptly changing 

Not flat ( x ) V ; affects V 2 

Parallel (x,y) Vj, v 3 additively affects 

Converging (x,y) . V if v 3 interactively affects V 2 

This makes line graphs especially suited to representing particular trends of 

one variable ov^r a range of a second, the covariation versus independence of 

two variables, and the additive versus interactive effects of two variables on 

a third* In contrast, the mental vocabulary for the shapes implicit in the 

tops of a set of grouped bars is poor, perhaps confined to "ascending-stair- 

case", "descending-staircase", and -"rectangular", a s implied in Figure 6*18* 

Correspondingly, there will "e fewer possibilities for specifying trends in a 

schema for bar graphs, and less likelihood of assembling specific "trend" and 

"affects" entries in thti conceptual message when a bar graph is processed* And 

tho predicates for a pair of shapes implicit in the respective tops of two 

integrated groups of bars will be even scarcer, preventing "additively affects." 

and "interactively affects" entries front, being encoded. Small wondtii:, then, 

that lino .jraphs are the preferred nethod of displaying multidimensional 

scientific data, where cause-and-ef f ect relations, quantitative trends, and 
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interactions among variables are at stake* TO convince yourself of the appro* 
f 

priateness of line graphs for these purposes, try to determine the nature of 
the trend of V^ ovor the range of Vj, and the nature of the interaction of V^ 
and {a variable with two levels, A and B) on V 2 , from Table 6*2/ Figure 
6.21a and Figure 6.21b. 

Table 6.2 ' 



V 
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INSERT FIGURE 21 HERE 
It should be easy to see from the line graph in Figure 6*21b that at level A of 
Variable 3, Variable 2 is increasing and positively accelerating, whereas at B, 
it is increasing linearly* Similarly, one can see that Variables 1 and 3 
interact in their effects on Variable '2* "This, is because the "straight" and 
rt concave-up" predicates, correspondiotf to "linear* and "positively accelerat- 
ing" trends, are readily encodable* in contrast,- the like-eolored bars in 
Figure 6*2ia do not form a group where relative heights can be described by a 
single predicate, and so inferring the trend necessitates a top-iown bar-by-bar 
height comparison, a difficult chor^ because it is hard to keep the heights of 
all the bars in mind (i*e*, activated in the visual description) at once* It 
is even [Tiore Uifficult to extract the trends from the table, because not only 
is a numbar-by-nurcber comparison necessatfV/ but the process of encoding a 
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multi-digit numeral's magnitude seems to be intuitively slower and more effort- 
ful than the encoding of a bar's height* 8 , 9 

However, bry to answer the following question by examining the table, bar- 
^raph, and^ line graph just considered: what is tl«* exact value Variable 2 

- level B of variable 3 and level 4 of Variable 1? Most people ^ind the ques- 
tion easiest to answer with reference to Table 6*1, a bit harder with reference 
to the bar graph, and hardest of all with reference to the line graph* This 
illustrates the principle of purpose-specificity, developed earlier and fre- 
quently noted in the graph comprehension literature, which i an inescapable 
consequence of the present theory? different types of graphs are not easier or ■ 
more difficult across-the-bcard, but are easier or more difficult depending on 
the particular class of information that is to be extracted* In this case, we 
have alrea"/ noted that absolute values of the dependent variable in a bar 
graph cannot be directly entered into the. conceptual message, since there are 
no visual predicates that correspond to them* Rather, specific ratio values of 
the dependent variable can be encoded," as can pairings between arbitrary abso- 
lute values and ratio values (from the numbers printed along' the ordinate),- 
the absolute value of a particular entry must be computed by effortful inferen- 
tial processes using these two kinds of information* The line graph is harder 



^Bertin might motivate a similar prediction by saying that orientation is a 
retinal variable, and thus, according to his theory, may be apprehended in a 
single glance in a line graph, as opposed to the multiple glances necessary to 
detect the several heights indicating a trend in a bar graph (recall that his 
difficulty metric is the number of glances necessary to extract a piece of 
information)* However, as we have seen, many predicates other than orientation 
may be used to convey trends in a line graph (e*g* r "undulating"), and these 
are not to be found in Bertin's list of the 6 retinal variables* 

^Incidentally, though a line graph is better than other forms of data presen- 
tation for illustrating trends, typically, only one way of constructing the 
line graph will illustrate a given trend optimally* For example, a line graph 
that used Variable 3 (i.e*, A vs* D) as the abscissa, and Variable 1 as the 
parameter, would ot illustrate the linear and accelerating trends as transpar- 
ently as the graph in Figure 21(b), since these trends no longer correspond to 
single attributes of a distinct perceptual entity, but must be inferred from 
the successive intervals separating the left endpoints of the five lines, and 
those separating the right end points of £hose lines, respectively* 
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^ISS^^i": V? r a^ W "lVOW this .principle- favors et^h_er line graphs, .bar graphs, -x»xT^ te 
^ii^^^^sp^ n ^ r ^- orijthe. type, of . question Ithe Reader is. to answer. Of "cpjSg$-V" 
^J^^^^Tnq.^ tj}&~ only" choices" that face a L graph designer. In this sec.t^nTJC 



-^Cjbg^ff ly. - show, how other dosigo. .choices, can be resolved hy_ :the,,.G£aptt .Di^.i^.. L \,^ 



) One gra'ph" with two iihes f ~ or" two graphs with one line? As m.entlonedjj 






variable foe "the conceptual message entry specified by that element (eg. in the 
-bar graph schema we examined previously), Xf the correspondence is specified 
instead in an insert or legend (i.e., with, a label next to a small patch 
sharing ^e color, shading, or internal structure of ~ the lines ^- ars), thac 
correspo. ^ence irmst be extracted bf the effortful inferential" pwcesses, using 
one entry specifying the distinguishing feature of the bar or line in the 
graph, and a second entry linking that distinguishing feature to the 
appropriate absolute value, based on the legend or insert* Therefore, labeled 
lines should be better (again, ;assuning the number o£ elements is not so large 
that spurious groupings arise), 

3 ) grid lines or not? r Whether a graph should include horizontal or ver- 
. JLJ^f^lf i?f 5 ^ ali 3q gd "^i^h absolute .values, on the. aras... and .^running,, across. #^ T; ..j v -. A ...,^ 

--t^^^^^f^. th ^ b -^ Tin ?^: ,,cpmm&"nibateS by/ -direct labeling, for -example, .~t^^^^= : ^/-^,^^ 
'■i^^^^^^^'.-^^^- 1 ^ * ^kr g»PjO * ..If absolute values,, are . important., - thS^^ff ^ 4. -, ■ X2 
"S^^^^S ^ .-^ol^^ng.""«ayr : the. to^of a. bar -Cor ,a. we(il-:, defined -se^ent;pf ; ^^5^ " . "^"^ 

horizontal grid liie, apd the .grS^^"S^-, : -. - ..-S 
= ^ 9 _ wl ^ an absolute value label .on the ordinate, . caus^g^E^I, . .". . - 
i^S^S x ^ n ^ the visual d^;eri£tiqn- ^ -7^ 

^! ^g^ fvg c ^ na ^ t ^ c ?. ^. messa 9e flag to this node. : cpjrfiguration, : specif yin^ 7^- „ - 

absolute value of the ^pendent variable, and con^ibu^g^ : :- - • , -"^| 
.^ onc& P.^l message entry when the graph is read, without the grid - : . ■ 
'^^^^^r-^ inferential, processes would have to deduce the absoli*t<^- -- - - - - 

. ^^^^■_s;by examining two distinct .entries, one. of which (the height of the qrdin> ------- 

: :labe.l_) would, probajbly .have to be. extracted via- top-down processes - --. -. - - 
ir -"- : -^F^ er ? a *y' if absolute values ^are not part of the intended message of the 
;--^::g^a^i, it edible that the lines will torn spurious groupings with graph 

- slants, or nay overload the capacity of the activated visual .description, ana 
^--^^n^that case they are b^st avoided*: 



XI, The Efficiency of a Graph Reader 
Until now, we have been referring to a single idealized M graph reader 11 . 
Naturally, flesh-and-blood graph readers will differ from one another in signi- 
ficant ways* For example, some peor_ <nay have swifter elementary information 
processes, or a A . rger short-term memury capacity, or more powerful inferential 
processes* Though these factors may spell extreme differences in how easily 
different people comprehend graphs, they are not specific to graph comprehen- 
sion, and we will not discuss them further* Instead, we will focus on possible 
differences anong peopl- in their abilities to read graphs per se. 

A natural way : of determining what nakes a person good at reading graphs is 
to examine what makes the graph reading process more or less easy (i.e., the 
considerations, in the. proceeding section.}, and to predict that, individual,,.dif^,,^- 
in the nature of .the struct^ur* s *td . processes involved will spe.ll^_--^VL 
^Ws;rences ;in the genaral. ease- with which individuals read graphs- . - .:Vv^:.^ : 



- ^-■i/ v?^ a1 ^ ^ a ?. #1* last "section, we showed that a given type .of -jjifc™^^ 
to extract from a given -type . of cjraptl if there were message --Slags'J 
T^i^i graph sche ^ * pacific to that, information, and if the. predicates toT^l ^ 
tne fla 3 was. attached were present- in_ the activated visual descriptipn^iiv 
S-^S- S ~*? hm Ea ^ h * actar Al^ws *of individual differences- First a person^ _ X , 
schema may lack important message .flags. Thus, he Or she may not knp^-"---_ 
". -^rallel lines in. a lin& graph, signal the' additivity of the effects of -two 

variables on a third. When pressed to determine"" whether additivity holds in .a 
certain- graph, such a person would have to resort to costly inferential proces- 
ses operating on a set^of. entries for ratio or absolute- values* In general, ., 
the- theory predicts thalr the-presence or absence of message flags in a person's 
scheina will have dramatic effects on how easily that person can extract the 
information specified by the fl^g. Second, the predicates that trigger the 
prefers whereby message flags are assembled into conceptual juessag* §n t^,s may 



be more or less likely to appear in the visual descriptions of different 
people," The needed predicate^/ because of lack of practise at encoding them, 
may not yet be automatic, and hence may have low default encoding likelihoods* 
^irthernore, the links between H-ose predicates and th^ rest of the graph 
schema may be weak, dissipating "ae "priming effect 11 which assists the encoding 
of inissing predicates once a graph has. been recognised* 

Returning now to" the first factor affecting the efficiency of graph 
readers,, we might; ask what will determine whether people have the necessary 
equation flags in their schemas, and whether the encoding likelihoods and links 
among predicates in a schema will be Sufficiently strong* ..As" to the first 
question, there are probably three routes to enriching graph schemas with 
useful flags; 

Being told* it is common .for. formal instruction in mathematics and"."- 
r -^0^..to r s^elX_out.what to look for in. a graph when faced .with a particui^r--^ 

^lf^ n * ?or ^^P 1 */ students learning "statistical procedures, like the %al^ 
!.y^^ ;^f Variance are "usually told that nonflat, lines indicate main effects^ V/V 
,,ffll$^V^llgl sets of lines indicate interactions, ~u-shapea -liiies indicate ^gS-j 

rat^e - -trends, and go on* - - :f = 

b) Induction * An insightful reader or graph, maker might notice that.- 
J^J^^tiy^ trends of a given sort always come out as graphs _ with particular--/ 

yfsu^_attclbutes (e,g, quadratic functions, yield U-shaped lines) * He or she r , 
could then append the message flag expressing the trend to the predicate 
..symbolizing the visual attribute in the graph schema* - 

c) Deduction, Still snore insightful readers could infer that owing ta 
the nature of. the mapping between quantitative scales and visual dimensions in 

a given typ^ of graphs a certain quantitative trend must translate to a certain , 
vi^a^l property, For example, a person coald. realize thit the successive doub- 
lings of variable by an exponential function must lead to a curve that 
becomes increasingly steep fxor* left to right, 



TaKen together, these principles suggest chat improvements in the ability 
to read graphs of a given sort will come a) with explicit instruction concern- 
ing the equivalences holding between quantitative trends and visual attributes 
(so as to enrich the graph schema); b) with instruction as how to "see" the 
graph (i*e*, how to parse it pc - ftually into the right units, Yi^--- 3 thej 
appropriate visual description), and with practice at doing so (making the* 
encoding process automatic and thereby increasing the encoding likelihoods 
and. associative strengths of the relevant visual-predicates); and c) with 
experience at physically plotting different quantitative relationships on graph 
paper (affording opportunities for the induction and deduction of further cor* 
respondences between visual attributes and quantitative trends, to be" added as 
message flags to" the graph schema)* 

: ?J~T %1X» Extension, .of - tile Theory -to Charts and Diagrams 

'r^JJuaiititatlye "information is not the only kind that is transmitted by~^y±s^- 
'^^XBpiaysi and it wculd be surprising^ the charts and diagrams usedto'T"" 
?*P r _ e ^ qualitative information were Comprehended according to principles^ cadi-- 
J^^f .different from those governing graph comprehension* In fact, the theory" 
~^.? c _ r ib&£ in these pages can be extended virtually intact to the domain oi. ... ._ 
"charts and diagrams^ Again, a visual description of the diagram would be. en- 
cp^l,. obeying the. principles of grouping, the indispensibility of space, and.- 
5^ on/ and again, there would be a "chart schema 11 for a particular subspecies,, 
which specified. a) the constituents of the visual description that identify the 
graph as beln-^ of the appropriate sort (e*g,, a flowchart v^* a Venn diagram), 
and b) the correspondences between visual predicates and conceptual message 
entries. The conceptual message entries would be of a form appropriate to" the 
qualitative information represented, and conceptual questions would consist of 
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conceptual message entries with the "?" symbol replacing one of the constants. 
The HATCH, message assembly, interrogation, and inferential processes would 
play the sane rolas as before. Charts would be easier or more difficult 
depending on whether the visual sys* encoded them into units corresponding to 
iaiporfcarit chunks of conceptual information /and chart readers / ^uld be more 
fluent to the extent that their chart scheraas specified useful correspondences 
between conceptual information and visual attributes, and to the extent that 
those visual attributes were encoded reliably. A brief example follows* 

Venn diagrams, used in set theory, consist of interlocking- circles, each 
of which represents a mathematical set.- Presumably, th*y are effective because 
the visual system can easily encode patterns of overlap (which will translate 
-^|?:,-- stit intersection), inclusion, (translating into the- subset-superset rej^^ 
_JS?]U> nonoverlap [translating into disgjointnessl, and so. oil* Simplif iedY-t YY 
^^S?J- Arrav ' v £^ai Description, " Chart" Schema, and Conceptual Message repi&Y.Y 
I^J^tioTis specific to Venn diagrams appear in Figure 6-23a through d. 

Yi^Y^ . . ;:_ \ ihsert eiguse 6 . 23 hebb " 

.- YZY Ev?n ^ c -^ .simplified, examples f . one. .can. see that:, _as ..before,. thaY YY-" 

JJES^lty of retrieving* given type, of in£ormation..will. depend on -what l-stdql.- 
-t^tl^isual. description and graph schema, and not simply what is on " the page.*. Y :- 
T^;"exa^.pl^, here the readar would-have to infer the fact that Set^C.is a subsV 
set of sez B from the conceptual message entry stating that Set B i$ a superset/ 
of \5et C* A mors efficient diagram reader might have a richer schema, . contain?* 
ijig^the predicate "included-in" together with ^"message flag stating that one/ 
set is a: subset of the other- This would spare that, reader -from having to-rily 
.op- inferential processes; "~ " _/Y 

- Other sqr^£ of diagrams and charts use other visual predicates to convey 
xhe-ir messages efficiently* for example, flowcharts use shape predicates to , ' 
signet:/ the type operation (e.g*, action vs. test), they use the contiguity 



of shapes tfith lines t_ indicate the flow of control, and they use the orien- 
tation of arrowheads to indicate the direction of that flow* *ihe linguist's 
tree di^ratus for the phrase structure of sentences use horizontal position to 
signify precedence relations among constituents, proximity to common line seg- 
ments co signify dominance (inclusion) relations, and above/below predicates to 
signify the direction of the do&inance relations* For each type of diagram, 
there would be a specific schema" spelling out the correspondence between visual 
predicates and conceptual messages* 

XIII » Conclusio ns 

- £ 

This chapter began with a warning that our understanding of graph compre- 
hension would advance in proportion to our degree of understanding qf general 
perceptual and cognitive faculties- As we have seen, the theory, outlined: hex-*-, 
\? n ^£^. borrows shamelessly from perceptual and cognitive theory, adopting, 
^^49 btners, the following assumptions? the necessity of prepositional Or " 
;;S;truc"tiiral descriptions; the indispensibiiity of space as "ft relates to visual 
/gfedieates, selective attention,, creation of perceptual units, an^ ^^uracy qf._ 
5?^?^ ln ^ ; lifted capacity of short-term_ visual representations; the.juse._-. 

-^^J^lrrieuted coordinate systems _ for encoding shape and position; the percept.. 
J;ual integrality of certain .physical dimensions; the use of schemas to mediate .. 
between perception and memory; the effects of physical salience on encoding 
likelihood; conceptually-driven or top-down enc-ding of visual attributes; a. 
MATCH process £or recognition; "priming" of visual predicates; and strengthen- 
ing of associativa links with practice* We trust that t-^is enterprise is not - 
totally parasitic, though, since in developing the theory, significant gaps in 
-our knowledge of visual cognition came to light. For e^a^ple;" 

& What are the exacf constraints on the physical attributes that can serve 
■a^ visual predicates, and what _^et*nain&s the likelihood of their being 



q tthac are the relative strengths of the gestait principles, and in what 
format should the groupings they impose be represented in structural 
descriptions? 

o Which constraints determine how message fla' .""an be appended to predi- 
- cares in schenas? flre there limits on the types of predicates, the" ~ 
number of predicates, the number of parameters, and so on, that a mes- 
sage can refer to? 
* How do visual descriptions guide top-down encoding processes? 
o How general can the information in a general schema (like the general 1 
graph schema) ,be? Can" such schemas be taught or enriched? 
"o Whan are , the decay rates for different sorts of information in the visu 
£ : .al description? . ^ _ ~ - 

submit guest ions" like these as particularly important targets f or fu- 
^tificS;_ research in visual cognition, ones whose answers will, in lar^e^part, Jd& 
"pfegqiiisites £oj^our further understanding of graph comprehension, . , - , 

Finally ^even in its. current" early stage of development the theory serves 
^njgul^role.as a guide for constructing ..charts and graphs. In the following 
Chapter we nafce use of it in our "attempt to specify a set of" complete guide-! 
jline^ fsr the creation of unambiguous, easily-understood charts ^nd graphs* .. 



CHAPTER 7: GENERATING CH£i^3 AttD GRAPHS 



In this book, we have focused on how people read and understand charts and 
graphs, We have approached thi;* problem in two ways. We fir^t considered the 
— cha-^-und grap:- 3 a conpiex~set of symbols" that work; together to represent ~" 
specific information, In Chapter 2, we developed in analytic scheme that 
"specified how these symbols work, and allowed us to diagnose the reasons why 
they sometimes fail to work* A- key component of cur scheme was the L set of 
''operating principles/ 1 most of which were rooted in observed facts about the 
operation of the human visual system, limitations, oh memory, and the way~we 
comprehend symbols* These facts were pulled together in the theory presented 
.A^hapter 6, which. reflects the other side-.o.f our approach y fco -the _pxoJbier^^^ v 
^ISfe;^ 3 ' we d ^ d not treaj; the chart or graph as a symbol system in i"te_^n.^-^ 
^-T^fc; but rather "considered what would have to go on in a~ reader's ^ead;:^^^-;^ 
' 'g^|^ r "for tfiat person to understand the information in tfie display. . The th.epiy-^ 
- P^£i4 es an account about Why the operating principles we have posited .ar el as^j-. 
. jjffiT jara* and aboat what underlying factors result in us needing a disp^y^Lth^ , 
_i.c"aiJtaj_n properties* - " V-i //t:^T-, 

...^/;. The theory just presented serves an. important role^when one wants ^to -gxw* -,- 
/jL.cjgr.t or graph* .In this. case, from the .outset one.wants to.avoid vLol^tLrig^.=. z S 

posited? it is awkward, to draw a display first, and^the^ 

anaiyze'lt, and thea repair: the. flaws* Obviously," it- is much better to keep. 

the .ec:entiial pcgbl^ns in mind from the start. and simply avoid succumbing.^ --- -- 
^ them;* One way to this is to try to put oneself in the head of ~the r^ad^V - 

iirst, try srf" specify exactly what information you want the reader to come^ awa£=, 

with when he or she reads the display,^ and then consider how D est to ensure .. 

-cfaat that information gets th&re, Ihinking about things in this general way 
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wiU lead you first" to specify your message, then to select the graph type 
(which is equivalent to selecting a combination of a type of framework and 
specifier) that will be the best. vehicle for that massage. (See also Wright, 
in press, who enphasizes that this "psychological appLoach 11 v aps a designer 
formulate betuer documents &nd computer readouts.) Once hav^uj selected, a type 
of franework and type of specifier, it is relatively straightforward to use 
then to represent particular information effectively. In so doing, one must 
&lso keep in aind that the pragmatic "factors described in Chapter 4 can add or 
shift. 

In this chapter, we offer a^step-by^tep guide to generating effective 
charts and graphs. This procedure is based on the analytic scheme -we developed 
Jj^J*^z*& eariie_r, Tiuis, the procedure .leads one to construct displayjSrAn^---:, 



so within, the maxims. "of the\-yarj^us 
the.pr&vious chapters, air guidelines - w 
^"if^^^i^^^- Precise .th^n ^e Relieve they , can be developed to -the point:- oi^^ '- ~^ 
1^:^^®!^°^^^^^:^ ^ computer program (and are now working on doing ju£fc^ 

;However f this .^program. wouldf have . to interact with human users bec^us^ I 
J^?^ ; sojiie questions that only the user who Knows the- context", in'wio^:".. ^. 
Z^^^^?^ 1 ^ occur _ — can answer* Furthermore, of ten ..the .user" does npt T. 

';Ss^£S^?^P. thsse crucial questions on his ..or her own, but "must be. promp.ted..:to^-- -- ~- 
Z^S^^B^i .tj?-^. explicitly.. ..These .g.ue5tio7a. must, for wie most, part,- be resolyedC- - - 
J^^^ an.£ actually lays pen to paper for pushes keys on a terminal), and -%huW . : -- 

-- I _:^-^° :nsidei: the:Tl be ^° w before- turning to the nuts and bolts of constructing, the, 

:~ : L-:£c:t^LL. display. ... ' - . . . - ". 

fts was evident in. the previous chapter, charts and graphs convey lnf^rzaa? 
: ."t&n ^t different levels of precision, ft rising line, is a kind of graph, 
^Ithqugh sail it conveys is that something is increasing relative to something 
-el^-, in such a display, the implied framework is nothing more than an assigns 
-M^nt-of direction, indicating which way the relevant values are increasing. In 
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this chapter we give instructions about how to generate the most demanding, 
precise kind of display* If the purpose at hand does not require such preci- 
sion, the superflous added information should be deleted* „lt is up to the. 
illustrator, however, to d^ide exactly what information is relevant and what 
is not? if this is decided -rom the outset, the system can be used as des- 
cribed, only now certain parts of our advice will be superfluous* For example, 
if the illustrator decides that the actual values of variables are irrelevant, 
he or she nay simply ignore all advice about labeling axes arid ensuring accur- 
ate reading of specific points* Thus, before beginning we must have a clean 
idea of what we wish to accomplish, ^ 
1 * The initial analysis 

Before one can begin to draw a display, one must first answer five ques-_ 
trons; 1) What information should be in the display? 2) what is the purpose of 
the display? 2) what impression do you want to convey? 4) Who are the intended 
readers? and 5) what materials do you have to work with? Let us consider each 
question in turn and consider the sorjts of factors that will enter into your 
decisions w . 

what information should be in the display? 

Deciding on what you want the reader-to know, after reading a given chart 

or graph i3 critical* Before doing anything else, you must decide what infor- 

*■ ^ 

nation you want to convey* A useful heuristic here is to think of a title for. 
ths dl^pl^y. For example, "Change in productivity over-time ir would lead to a 
different display than "Amount of oil produced*" In the former case we would 
certainly plot values over time, whereas in the latter we might choose to dis- 
pense with tisie and present output from different countries collapsed over 
time, If tisi^. £n fact is irrelevant to the intended message, dispensing with 
Sz jnight sav^ the reader effort and possible confusions* Only after, this ques- 
tion resolved can one know which data are relevant* 
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The second question follows naturally^ from the-first. Given one has 
decided on the data to be presented, what, level of detail^is necessary? Should 
the reader use. the display to extract specific measurements or just to get.an 
overall impression? If too many data are present, they will have to be boiled 
down into a relatively small number of averages (as will be discussed shortly), 
and it is up to the graph maker to decide wili.cn levels of detail must be sacri- 
ficed* Is it necessary to know' da ta about OvOry day of production, for exatt- 
pie, or are monthly data satisfactory? If the reader is to extract arbitrary 
levels of detail, and one has hundreds of numbers, a graphic display may not be 
appropriate at all to this "archival" function* 
What ittprassion do you want to convey? 

&t this point, you should have a set of numbers that could be displayed 
\y&ing several different graph types (as will:be discussed shortly)- Before 
selecting a graph type (i.e., type of framework and specifier), you should 
decide two more things: Co you want to emphasise or de-emphasize a given rela- 
tion? If so, what is it? If you wish to emphasise the rate of growth of one - 
variable over- another, you should keep this in mind when selecting a framework. 
Recoll^thac various pragmatic factors will vary the impression a chart or graph 
conveys- If need be, flip b£ck to Chapter 4 and briefly review these princi- 
ples. Keeping then in nind, we will soon see*that different frameworks are 
more or less pliable for use in exploiting specific principles/- in addition, 
if it is necessary to decorate a page as well as convey some^pata, this should 



be kept in ^lind when selecting among the range of possible frameworks and 
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specifiers. Artists often use depiction so that the framework specifier 
and/or background reinforces the- basic message* for exampler td^Present 
information about unemployment they ar fl tempted to use a line of p^opl^ 
waiting for 
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unemployment benefits as bars in a bar graph* Tempting though such decorations 
are, however, we stress that it ig far more important not to violate any of the 
operating principles. In the last Section of this chapter we present a step by 
step procedure that should prevent; an artist from doing 50, even when the dis- 
play is q unorthodox* Tn the fifty case., we have tried, this scheme proved 
adequate, and we intentionally varied the kind ^ depictions and technique^ 
used in an eftort to strain the system* Further, the system is so explicit 
that we hava written a computer" program for the APPLE computer that produces 
viola.tion-f ree charts and graphs (write the authors for more inforDation about 
the program)* Thus, we are confident i-i recommending our procedure for use in 
designing bost: displays* 
What/is the intended audience? 

— ^ 

The nature of the intended audience is important for two reasonst First, 

the concepts you explicitly label in the graph obviously must be familiar^to 

the intended readership* For example, plotting first or second derivatives and 

labelling the axis as such excludes people who have not studied calculus • The 

sane information could be presented by plotting the si ,ple' level of a variable, 
1 

and al|owing the first and second derivative to be read from the graph as slope 
and curvature respectively, which is a simpler and more accessible concept. In 
addition, no exotic words should be used in labels, nor should uncommon symbols 
be employed* Second, the graph type used is to sotne extent dependent cii the 
readership* The graph types we will discuss here are all common to most liter- 
ate people, but there are others that are leas conftaftn, with visual patterns 
that are not obviously translatable into quantitative trencFs, Fos^e^mple, in 
engineering studios there are diagrams in which information is displayed as 
blgtches whose shape represents information in a polar coordinate spaca* These 
grams are quite mterpretable to one thoroughly familiar with them, but are 
only a Pxirdrance to the rest* In many cases even the comnoa graph types we 
will -iisc^s may not be universally known, in which Case one has little choice 
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but to use ta^ ' presentations.. (In fact, for simple and small data sets, 
tables are comprehended quite we'll* even by children? Walner, 19XX; Wainer & , 
1<>:<X) * 

what is t;here to vork vith? 

The final thing to keep in mind is . cher basic: What physical materials 
are there to wort vith? Can color be used? Is the display going to be in a 
small area of a page, or on a large bulletin board? Will the display be on a 
computer graphics screen with a coarse grain? Can you vary the weight of 
lines? The size of letters? 

tfhese five general background factors must be kept in mind before one 
begins construction of a display. Only after resolving these questions can one 
intelligently proceed the next step, deciding on the structure of the dis- 
play* 

2* Choosing the correct displcj. type 

xtoving decided what data one wants to illustrate, ,/hat use the display 
vill be put to, what overall impression is to be conveyed to which readers, and 
what materials and on hand, one is now in a position to begin drawing (drawing 
taken in its broadest sense, to include displays on a CRT) * 
Charts 73 « graphs 

-"he first question that must be asked is, what is being related to what? 
That ig, are the relationships you wish to convey essentially qualitative or 
guantl tative ? Hecall that in Chapter 2 we distinguished between these t*o 
kinds of -semantics, noting that charts usuAl-ly convey information about quali- 
tative relationships (su<:h as "is a member of* or "occurs aftfjr") whereas 
grap^ al /ays convey incorrcta tlon, about quantitative information ("x has more 
tha.. y"). *We pointed out that there are a nunber o£ kin-is relationships' 
possible m both cases. 

To review, when viewed as format symbols, both charts and yra^hs convey 
intimation by placing parts o£ a ^rasiewor}: together. Charts - do so by omnac- 



ting distinct framework elements (usually boxes or nodes) with arrows or lini:s* 
The relationships symbolized can be directed or nondirected* a directed link 
is not symmetrical: for example, an organisational chart has -links labeled 
"un^.er supervision of 11 or the like whichpoint '''own* A symmetrical link, such 
as "sibling of" in a famil-' tree, is equally v^id going either way. The rela- 
tionships in charts can also be all of a single kind ( t as in a flowchart, where 
all imfcs mean "followed by") or can bo of multiple kinds* If they are of 
multiple kinds (such as would occur ^ a family tree), the different kinds of 
links r;iust be clearly distinguished and labeled* Finall.', a given part of the 
framework can be related to one other pa~t or to many other parts f depending on 
what is being discussed* For example, one-many mappings characterize hier- 
archical structures, and one-one mappings characterize flow charts. 

If the kind of data you have is of thio general type, where distinct 
entities are being related qualitatively, then^you want to draw a, chart* In 
drawing the chart, first decide on the basic structure (hierarchical tree, 

sequential stepg, -ate*)* Then consider the steps discussed in the third sec- 

* 

tion of thi3 chapter* Be sure that the important relationships to be conveyed 
take the com of easily perceived visual patterns, as will be discussed 
shortly* 

Graphs represent information by pairing an extent associated with one axis 
with a position or change of position along the other* in this case, the 
specifier serves as a function, with each relevant point along it pairing a 
point or regior. on the horizontal a^is witn a point or region on the vertical 
axis* Gt..;" s reldte two different scales together, and, depending on what kind 
of oCal£3 are b^inrj relat&d, different graph types are more or less appropri- 
ate • 

To r^vi^v Jjiiefiy, thor** are five scale typos. Nominal scales ar* no^ 
ordered -it all; nimbers or other symbols arc us^d a- lab^U (as with corrrpai ±J 
nafnes, nun;:or> on athlete's sweaters or the like), Ordinal scales are rank 
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ordered only; the actual magnitudes of differences are not reflected in the 
ordering itself (the difference between first and second may be twice as great: 
as the differences between second and third, but this will not be evident in 
this V;md ot' scale). Interval scales preserve the actual quantitative differ- 
ences between values (such as farenheit degrees), but do no t have a natural 
zero point. Thus, ratios cannot be tak^n amonj items on an interval scale; 10 
degrees farenheit is not twice as cold as 20 degrees farenheit. Ratio scales 
are like interval ones bat they do have a natural zero. Thus, not: only do 
quantitative differences among values have meaning, but so do ratios. Two 
hundred dollars is twice as much as 100 dollars. A fifth scale type, absolute 
scales, are ratio scales with non-arbitrary units: nunber of jellybeans in a 
jar for exar^ie, unlike dollars, which could be changed to different units 
(e.g., cents) vita no loss of information. 

ThMS, if you are relating variations in some quantity to something else, 
you want to use a graph, tn many cases, one of the things being considered is 
a set of nar.es (i.e.- a nominal scale - company, country, condition, etc.}; 
the ^osr frequent exception occurs when changes over time are considered, in 
which case a ratio scale (time) Bay substitute for a nominal scale/ In special 
cases the otter scales may ser e th« sane purpose. In ail these cases, one is 
comparing a n^r of things with respect to "a single scale of measurement. 
When nor-3 than on^ sc^le of measurement is involved (i.e., several non-nominal 
scries are ^apr^d onto a nominal scale), we recommend that the choice of 
separac? fr-^e^rHs versus a single f r artwork be nade according zo the follow- 
ing crit^ri^n: if the sinilari^ies or differences amorvj the non-nominal vari- 
ables a« pirt of the intended message, and if the number of such variables is 
not too l^r^, ^hen a single framework sho i JlJ be used, This allows overall 
sirul-inf/ of different or trends to be display^ as paMlUlian or various 
sor^s or n-Au'-ir-i I holism (e,^., fanning out), which can be p^rc2Wf>d as entire 
' rfIth/;<1 - nee ^ to glance back an:* forth between gn^hs. However, U 
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such cases one must: be aware that when one u*es a single framework to represent 
more than one measurement scale* it is difficult to signify how values are 
related ta specific specifiers. In many of the cases we have seen, trying zo 
include Tiore than one scale for dependent m* ^ures in a simple graph results in 
ami>^^ .o-i anJ An incomprehensible display .e.g., see the second scrapie at 
the ei;J of Chapter 2, where the twiddle framework was used as zwo scales)* Thd^ 
it; is inp^rtant to use similarity cf colcr or shading, or explicit labels for 
each liT.o and scale, so that the correspondence is apparent* As a corollary, 
if the nunber of different scales is large, or if similarities or differences 
in trends are irrelevant to the intended message, then separate graphs should 
be uced* 

Choosing the correct chart or graph type 

Charts « If one is dealing with a chart, the choice of a graph type is 
almost entirely dictated by the nature of the connections between the things 
represer*o3. If one is dealing with one-many mapr. gs, where eachthing is 
connected to two or nore others, and each of the.se in turn is connected to two 
or more other things, a hierarchical scherce is dictated, the convention is to * 
put the el^ents of the framework (boxes, nodes, depictions) such that the 
el^rent.^ at the "dominating 11 e.nd of their relationships with other elements 
f "dominating 11 neanir/j "supervising" or "including" or the like) are^.higher on 
the page. If you are dealing with one-one mappings, the nature of the speci- 
fier^ di^a^es the framework again, with temporal sequencing requiring a left- 
to-ri^'ht organization in this culture* The constraints on chart construction 
thU2 ^n-^> not jo much fr^m th<? general nature of the specifier and £ra:;tework 
ay from th»i vfer-itmg principles, especially those th.it proscribe violating thq 
limit^.i Lr-^o^i^ir,^ capacities of human readers. The^e principle? <ire lsvrsrp- 
or-^t^d :« tne -;p*ci tii 1 n^*:r J ic~ i to fallow, 

r - ro^lovinj s^Cil^s v*? will eonsiJ-jr when *ioh c? tsw ftv^ 
mo<it -ripi tyc^i is appropriate for each typt' of dat-i, The grripr- typ^s 
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we consider all are in conmon use and are relatively general purpose. Let us 

first describe these alternative graph typos: 

A Pie graph consists of a circular framework which is divided into a set 

of wedges. Each wedge represents a percentage of the whole, as indicated by 

O 

its relative area* 

, h divided bar graph is like a pie graph, but the framework is rectangular. 
In addition to having the internal area divided vertically into a set of 
snaller rectangles, each above she other, in such graphs it is common to have a 
scale marked along the left side of the framework. 

ft line graph usually occurs in an H L M shaped framework (which is so,neti^es 
closed into a rectangle), with the scale of the dependent variable (i.e., thing 
measured) associated vith the vertical axis and the scale of the independent 
variable (things that measures were taken of) associated with the horizontal 
axis. A line (or lines) serves as a specifier, providing specific values, 
differences, and trends of the dependent variable for specific values^ pairs of 
values, ar,i ranges of the independent variable. The height of the'speci fier 
line over a value of the independent variable corresponds to the value on the 
vertical seal* at that height; the shape and slope of the line as a whole cor- 
respond to the difference or trend of the dependent variable paired with the 
range of the indopeTtent variable that the trend beirvj examined sits over. 
Lines are continuous, representing each point on the x axis. 

In general, a bar graph is like a line graph except that bars usually 
stand ?r tho labeled locations on the x txis. The height of the bar indi^ 
cat*,; tnat the values at that height on the vertical axis should be assigned t- 
whatever j.5 la^el^d under the bar. Etar graphs can be constructed wich the bars 
bevnj vert:-^I or horizontal; when bars are horizontal, the 5 cale of the 
deper. \*>r^ "mti -ml* now is on the horizontal axis. 
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ft surface graph is simply a bar graph in which the bars are so wide that 
they are connected, flowing horizontally into one another. As in a bar graph, 
the area within each bar is often shaded* 

We have chosen not to treat one last common graph type as a distinct 
class* Picto^rans are siraply bar graphs in w v * ^h the bar is replaced by a 
stick or identical pictures* Usually each picture represents sone fi^oi nunber 
of the units of ^ea^urenient (e.g. , each barrel nay stand for 1,000 barrels of 
oil produced)* Those graphs function pust as do normal bar graphs, with the 
height of the bars indicating the value of the particular thing being measured; 
the number of picture is completely redundant with height* The only cases in 
wh^ch this is not trv:e involve the unit picture being assigned a value in a key 
and no vertical axis is included* It is conceivable that there are special 
circumstances in vhiih this is a desirable feature, but it is not apparent 
what are the general principles that will identify such situations* Thus/ 
given that depictions c^n also be used for all of the other graphic constitu- 
ents, we did not consider this one yase sufficiently different frora standard 
bar gracr.a to warrant a separate category* Rather, pictograms result when 
depictions are used pragmatically to reinforce the point of the graph, or to 
convey ab_ioLut^ anounts by allowing the reader to count symbols* 

Fitull'/, we have not discussed location graphs , which usually consist of a 
map with difrer^nt ^ynboli> over different locations {thu 3/nbols represent 
thirv^ lik- the population t>r temperature at the locations). These graphs c *re 
not ^en^ET^l, but are used only to map values of a dependent variable to speci- 
fic lo^iti.ns* They do not function is d:> the noro general $raph typ^, with 
eich ^pitiil im^niion standing for a different nonspatial or con^pttnl 
dimension. In^*>Ml, each spatial ^location on the graph ^pre^entr, a spitial 
locjii:>n in r*w worM, Ih^T, tfrwse gr^;h^ are in fact mi.fnanedi th^y ar'; 



PMI'V 1 



T*/ ~i*7 + r witn partio-iiir i nforni^ir.n quppl vi in a^Ution t'> 



i ">Ca " I 



i~ jy 'ion^ in road r^iog lnlic-tting height -y population d^it^if/ 



277 




£>y different colurj) . In tn* r s book, we have explicitly excluded discussion of 
maps, foe the reasons discussed in Chapter 2* 



scale o: -easur^mont (the "dependent variable") and the nature of the thinqs 
Ming -j^.ipir-?d (the "independent variable")* But these are not the solo deter- 
-un ir.t c ; . In iJi:,tion, properties of our: perceptual and memorial systens feiv^r 
so^ ;rapn types over others for specific purposes* Thus, we must consider two 
thmj3 when choosing a graph typ^: The nature o£ the data, and the purpose to 
which they will be put* 

Let us now consider in more detail ho* T different factors affect the choice 
of a gr^rh type. First, the five types of scales can be further divided such 
that prep >rtior. an! ^er^ent^ge are differentiated from other types of amount. 
If this is ione, we nave/ four classes of measures: Ranks, proportions, inter- 
vals, and ratio scales* We must consider the appropriateness of each framework 
type cini specifier type for each kind of measure when the itens arranged alonj 
the x *s"al"-- are thenselves ordered on a noninal, rank, interval or ratio scale* 
Th :s, we hjvj £o^r possible measurement scale types and four possible indepen- 
dent varr ifllo scale types, resulting in T6 unique pairings* But this is not 
all there L:i to it* sad to say. We mu^t a I so take into account the purr--,e of 
the ii>-lty, vhic;. of --en will the deci iing factor when multiple options aro 
te^hr.i^ I r . / ipe r r ; r . jZ-i . Feci 11 th^it we knu,/ that people hav? a diffioulr f.ir" 

i - jl r, r I ? i jri of a perceptual unit and ccvanng it to another unit or 
j.irrt tr.-*t':-r ;ee 'Jrnpter 3)* Th T i c ;, line and £?urfvj«_* .jr.ip::^ ar*^ to be avoi I* 1 
wr.tfn >-^*i?i a . r r jint i nforn-ition is beinj -j-jnvey-'- 2 . Oi the oUiot h_i:;_I, re:;/l * 
httv*? i 1. ui- i :ii'r:iv/ fo^ appreh^-n ; inf ^rr.i*:ion in*l nakin J con j. infers . ~* 
T.j,, r tr }+i\v*, ir*- to b*i avoids 3 I! 4 ' rvJreroe:; {nor*' thin i ^\r) poi'it; ire 
j lf .* . T -r^, - r ^ ^ n ♦ iij-*r jf pursue o~ r,- c.- ,Jf r v r ; * -n , ti^ b^ ni<:-' ( Lis * j 
;r ip. . v_r i , ciii u ; J i L Ly ne onc'j J rc : ,e i <j£ r- la*"3 v-*ly re-. 1 

u .;:<-r- r :^r .-w^hL nmti 'we 4 , u^wirt r:u;,j lir*-'. ,f L'*' shape ;, ^t 1 .), 



ra^or dot*/r.unants of the best graph type to use are the nature of the 
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vnieh nLni;n:^9 problems duo to memory capacity limits, and allows abstract 
patterns or inference*; or trends to be depicted as single visual properti'":* 
In addition, to considering which fornats are least taxing for a specific pur- 
ple, tti't re ider rfili hv/e to con^id^r which form/its are easiest to modify to 
^.phasize i p^rtirular point. 

Ran/-; Jit.k ciinot presented in a pie jraph or a divided bar graph (wh: z:\ 
i; roil:/ i rectangular pie with rectangular, stacked slices). Of tho 

renaming graph types, the one chosen will depend in part on which scale is 
us*d alor.j the x axis, as noted below: 

Ho-nr.il scale fjon^ the x axis : If itens along the x axis have no speci- 
fic ranking, bar grapas are in general the raost appropriate grap\ types, If a 
bar grapr. i^ cho;er., horizontal bars :nay be usei if the:v is no inherent order- 
ing a^cr.g the rea^urei things (although vertical bars taay be preferable because 
of their famliarlty alone); if the things are ordered in some way, a vertical 
f^rnat is preferable, with the bars being ordered leCt-to-right along the x 
axi=;. It is or ton a good iiea to or^Q r entities fron the greatest to piaalle<st 
a\or^ one* ;f the scales used. As XXX (l9XX).has pointed out, usually the reider 
^ ia aa **x? " i !- ion as to the ordor of entities, so that mere presence of an 
i ten in in ;::-*xr°cted ;?oi-;i tion i tseif; conveys information* Furthernore dicco^i- 
ti-iit-i; in the j'? ja-fn^ (e.g., if the weal^fuest 20 countries are far ricn^r 
t-ir, vv- rer Mjj, will he apparent in a larg»- step at " e point of discon- 
ti-i-i l* /* w*h'*'\ 1 di-scor.titvu ti^s (?'<i-jt f and the a-rti-t wishes b7-w;My;ir^ 1 

-J.'^, * - ■ *" -r ir h a bir jrrv. n-^ cp ic-; between the bir^) 1" 

y 

' J' 1 *" ^- • r ; * m :■■ one nigl*'- ne t^ppt-M to e^rrtt 1 in*- ;;r iph *h'"i 

*"' "\ r •,-".*. ; .» t nuu! >''il'r, rea>.r:.iM thii* i hn" jraj * >-s t*h" 

r "*-r :->' . IT'*'.- i f > 1 1 > lh-"i r in-^ ^ontiviri . 'ioil^, wi ,> in' .^r; ->! tb ■ *• 
-r*"f *'. * Vi, I--:, r-'pr^^'/'-i by >no :<-«i/£n. M,ii*y -sMn.hr h 1 .e 'jrr^ 
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r^v^ni T A: - o :>nv jn*- l vi , Ho,viv*/r, th**r*. j ir-f tvo oi^'uni'.anT'ij in vhnU * 
lir^ }ri: T :, ;uspr: .1:1 ;Ly, appropriate Fir^t, 11" the mnb^r of x v*iino<3 i< 
snail, in; :::-r j ar^ ;-^r.il lep^n.tent vir-;-: nh.it b^hnve in -lift"rviit wxyr 
vitr, r**;i»:z O -^n^-r^ -t, .1 nultipl^ lir*; -jr tph will tran;l-a^ thr - 

fc f'jro:: :» j ; ir.t^ ;aL:-:ii nonpars 1 hnv; , Thi - 1 11^^-3 th- rtiu.i^r to .j * ^" 
ti.o 'ill t ;r-^^i -K-oiinj th * nonp.tr a 1 1^ li-n lir^-'rlv, vitrivnt: havinj 
oxi^in^ -,:.*f jl"4 : r. Mr-by-bar Chapter The je:on.i ca^ wh>*r>z a Isn-? 

;jrapn 13 -anla^:: :or rr^nirial Idia occir j *?h*?:i a) the graph format ir? ?t\niir: 
iced, so th-* i^iz-^z-ri'iW. or-i^r or noninil <scil- valj>?: dl.r.-j ;< is 
non- jroi tr 3r/ m t.v; >ja^ that it i<> the v n^e for every one, ani d) various 
pattern; of v^l'^j of v*'* y-*calo with respect to the ;< nu*>t be dif £'?r -*;**t L^t-^ 
In th^s*: :m> ;, 1 1l t >> araph .iltovrs each, pittorn to be repr*i sente*! as o lu. ■* 
witri a uirvf jnt, universally recogruzab'.^ C3ntj>ur- — again sparing tSi*; r^a^r 
fro^ hr/m; to imiertavtr a cogmtively cost*/ elenent-by-^lort^nt conpari^on* 
Qn<5 not^bl? exa^pl^ is the Minnesota Multiphasic Personality Inventory, * 
persona Lity t^-; ti " >r> .>nro yiel -Utv; nine val'wj u^u 1 1 ly lynty] * a I: n f - 

gripri cul-i a "prori^lV, In th^ lore of rt"Pl m*erpr«e"ation/ a line olopmj 
up t^ th*"* > j ft r^:, r»* sen ts a neurotic personality/ a lin^ ^lopxrvj >ip to the 
n pit r**pr";*inv a psy/noti t personality, an i a^on 5 ta^ rtorv ^v-u~t pvy 



t t-^r:i j ; i- U 'i-'-", ban *n *■*, tn^ fol 1 r. ; i : l >n 1 1 ^ ^ L » l *■ i-,r. , : 
'■' : ;>r J r *'j i rJi ; -r 4- *:^ t? K *- ,ri v,T "i--j u.-- :i"y 
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A r-lr^l 'li-; involv-o truly irre.i i nhl/ ordinal r>^Al^s, su -ft a * ran 1 -: in 
a -:i or tiru^hi:*^ ord-^r in a race* In the ;*> cisos, a bar or surface jr^ph 

i^ pr'-*:*"i a; I-* to -i jr a: h, all other thxn^> beinq equal, f>inc*; zh*j lat* >r 

c , i rr** I**v irt*- -r non^xut^nr >: -contini ia^ against which t?*m d \ in y 

-My » * ';^ii;lly "\v-iii. Hov^ver, as .ith nomnal scjle^ ordinal scale? -iy 

o** ro.r- 11 /",: i vr.;, line ;r iuh<; if tne iifr^r'-nc^ between tw:> ran-: ordering, is 
b°ir. i ?o *r • i, that i 1 1 1 *r -jnc*a.5 m ran^irs^o can perwiv^! as patterns oH 
i « It ;e* 1 ^ re i^onabl-^ to a^sn^e that reaa^rs will be as fata enough to 
rt i i^ i li\e ;ra; * r -p r e -;entxn<j in ordinal scale and not change their concjpV."i 
or" the sciLe ^ ft revilt ,o.^, it unlikely that they will think that 
fir.:r>n,';; or kT in a ric- is an interval scale just because they hive seen it 
r tip re ;*?nte i in t line jr *rrd ) 

In irorn^r ^la o: case:, ordinal data are used to sanple or exenplify 
portions jr whtt ^ conceptually an interval scale* For exanple, one night 
want to lllu^trtte that lar je populations ar>i associated with high m^ant mor- 
tality rit"->. In tni3 case, countries would Corn an orlinal seal** if ord^r^J 
oy po» jlitit-, bit it Is not the identity of tne countries per se (as in a 
n~M: il ;ttl^) ;sr the order per se (as in a race; that is conceptually mror- 
t*tt* It ii th" ; /; ilitijii -sral*'* itself tmt conceptually an interval 
s~il>, i:. j i *h *■ tv/v+H m or ier tor 5 reader to conceive of ' thv 
'it i i j i Lwi. r d it' . r r b+ * , j i ' d f - e* * pr '""."i o i nt r i r 1 

( i ^ r* ■ - t ; - y; ■■ 1 1 o a " i r /\rJ r : , wi tn in t**r rudi r_" va In',"* coulJ h^ n >*d 
* J " ^ i' - ■ t * - > " , . Tn , i -a ; ■ ; , th'-r*' * ; ! ■ 1 s>- n , T' * . - i-o'. t * n ;^ Li r**" 1 
' - , " * . ' ■ * - r r ^n ■/ i 4 ,r ii;. zri " y no »t L ipr r r .vmio ^ , T in t»*rv x 1 r 1 1 : 

■ i ^ 

* ■ ■ i" , i 1 h a f pr : t*- f ,r * h- -:i r i , --1 d i* i. N^* h^^-v-* , tlv 
■ * ' > ■ * ?*o. " r * r y r ,f 1 ■ ■ ■ r v 1 1 i, T r i ; , \j : , \ r- : - , i * 3 ^ , ■• > ■ L ■ : 
■ T r ' 1 1 ^ i*" j > 7:,*' * i r* i" 1 '-, m j n *■ ^ :j ^ ;raj n * y m t ■ 

■ 1 - ' ' T - ) ,f ■ , whi ' ii ^ 1 r/ wi 1 1 i*t* pn*- * I£ j r, 4 « t m; ^r- it p.h -jr 
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th*i i.^o*: — ,f i:^-?r t.^' is important, a bar format is mor^ aooropfi.it"; if 
1 or tren i lar or/i i t_i^n m important, or compir Liom a™?. 511*7 slopes, diL ler~ 

tr^i>, 1 I in** ^jrapn is ^on- appropr iat*j * 
rT :'.fr, t I" ';r or": r " 1 crs lata 

A. Jir S'^ku*! 1 c ^ e * Wtu*n thi : scale 11 the inJe. ,dont variably 
;-ir*:enCi'j*» pra^ti^n lata con be qnp l tei u^inj three different tyraph r Vl"~* 
Th-? vjs: .^^"'in f ^r^r. 1^ a pie qraph, wi -^i the relative area of si 1 ■-*'•'> repre- 
sentor; j tr>: pror or ti :>ns ■>£ th*-* representee quart ti ties . Thui format ha 3 two 
inpor t i'.t li ^: : .n 01, >, however * Fi r s t, precise conp ir isons cannot be made in 
no^t ca >**s beci'i'.o it is v*-ry difficult to :>ta;-ure on th«-; framework ap| r> 
prratel/ '^::r t ^ 1 Lr jtrac tor (thouqh a series of t j.c 1 -' narks arrayed around the 
cl rc^rtjrer^ri of t;- pi- can help). Second, no more thaii four or fivt= slic-^ 

"present:-;; ^_»os 1 r eleven t entities shoili be u^ed. Thu«; f if a nunber 
dif f eront n is <x tnir.qs (i.e., different specifiers for each independent 
vari^bl^J are prfi, a nuLtioLe franewyrk display or one of the £ Llowma 
in:-; ty ; e , will be r^rnred. 

Tn** ;e^;o;, i >/ 1 jn is a divided bar <jroph. Thi:, forrut is a kind of spjir^ 
pi^ 7rv^., wi*:. t;:r,' l^r::th of each bar r^presentin-j the proportion. A scale is 
c-^jr.l, ;! : :■ : ;an be '11 5 lead mq if r A i„ib^rS we associated with it. Trw 

h^i r : :r i not repre-^nt the pr^porti o^, a*; one could mi rAab- nly 



wnr.sT fiovp;; 7.1 hit:; 

i of tra ; r^^nat 1*? f,n it i/re"!*!*' valuer c i t 1 bt> ropre -•Tit" 1 
r rn;:i'»v<;r.< jn 1 can a 1 Lov *n** ro re 20 sp*'*iid; Vri an 1 cj'.- 

" • 'MmI/) an i f ~ T i 1 t 1 : !_*_• 11 i" iL tn S ~- ri^pr 1 1 r 

r ':,/ iiff^r^nr bt"w. T.'i Hii ; format ha * ah >>r 1 Mi- > 
■ r i i * / ■ . * m ji'/'-a t h < ■ p.* jj !'> can pr" : ■ ;b u + f 
" • > ,' my iii f "rnil ii i 1 .-a t i"» mii tr d,"1 m .i ; i" 
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Thu ^vri npM >n with t:h*:"n j data is a bar or surface graph, with porvnt- 
ajes heir.; r»:pre ,-uir.ed along the vertical or horizontal axis* In this ca^^, 
v . wever, tr.- reader will ao^ how the wnol^ is divided up, with the quanti- 

ties jf trie va r i ou 1 : entities neces^a r ily in exact invert propor ti*' n . All tha t 
'Til be -ir-' differences a;aong parts* 

B* ^Jjtllli "a le plug ^mother r.cale tlong k dxi^ * Matters becon<- no r e 
ecnplex *-rien proportions among a set of normal va]ues are th^n contrasted at 
j^v^ril levels of an additional independent v^riabl^ Figure 7*1 shovj the 
three li:£ert;nt ways of graphing the sa;:;e data when two independent variables 
are con :-»red, ttote that in the pie graph you can see relative arounts easily 
at a s:^]l2 level >f the variable distinguishing the different pies (tm^, in 
this case), but Lt is difficult to compare actual amounts across the multiple 
fra^ewor< J >. This kmd of comparison is easier with the divided bar graphs, but 
tne relations t no:: ; the individual conponents are not as transparent, In the 
final case, where we have separate bars for the two kinds of soap, the trends 
in percent-*^ of people using then are visible, but their status as propor- 
tions, witn an increasing share of one entity necessarily eating into the share 
of j^v.t, ha; no direct counterpart in the visual description of the graph* 
Tnus " J r i " -lultiple iivi led bar graphs seen be^t when proportions of nominal 
scale vili-rs are co-tr red jv-^r a second noninal scale, or a truly ordinal 
i Iv i ^ i J- 1 ab ,ve ) * 

A ~!'>.e to ii*Ml ^nso for proportions varying over sorv-> intetv.il or ratio 
v il^ i> -t.j*. w^- •-fin oall t " luw/'i;'/L I bar graph"* Here the relative wi Iths 
o: -sr "hit partitions Uu* area of a rectangle horizontally c^n cnan jo 

:;iu , j »Ly rrurc right to Lett !iee Figure In thi : ease, change m pro- 

:^>rti ov-r \ i v: translate** mo tapering l'*;t, tap^ms right, bulling, or 
*a:r."- r -* : , - jr;**nt an i th^ reciprocity of th^ proportion; of difl* 1 
>r>p->,.- ' i giv->r. ni^ i, evi J-tU f on th-- faot tnat ill tj^ vnt'i must b-> 
^ ; I'i^-* -i ; ■ ^ ) 4 r»- "anjl-! of un-e^n 71 r.g bei jnt * On-* poO-nU-i I probb\.< wi th 
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zni3 fornat t 3 that the ^lop^ of segment^ though per 'joptua 1 ly aval labie, i? 
concept'Ml I / irrelevant, and may even interfere because the slop** is percep- 
tually iai^«Jral with the conceptually relevant dmension of segment width (soe 
Chapter* 3 inl way to counteract thi-< problem is to stac); the segment; 

m a top-" ^-bg t ^ ^rd^r that reduce overall slopes ani emphasize wi dth -- 
v hus Flair-* u i superior to ""he saw Lnfor^itnn lj raphe i in a different 

;>rder in X( ,o) , 
4. Selecti na the axe^ 

thii point/ the r^aier ->houl^ L-j >jn\v to select a graph type for a set 
of ia^a. The une->ti^n now become'; which of a nurnbfr of independent variables 
shouli be placed on th^ :< axis. That is, in nany graphs theirs are multiple 
specifwra. Each ^pe< % ifior is 3^ be led, and in fact the entire set of these 
labels coul \ ;u:>t easily have been placed on the x axis, with the labels 

% 

originally iayed out along the x axis now being paired with individual speci- 
fiers, F: ;are "\2 illustrates such a case. Once again/ the choice of layout 
dep^r/ia on th-* parpjae of the display. There are four rule^ of thupb here-i 

XNSr^T FIGVPS, 7,2 llV&L 
Firat, the designer should decide wnieh independent variable {tines, con- 
dit'.on:;, y*'ir; f ot<; J is corapojod of ^ntitiet; are to be contrasted ;;ith on^ 
another {lot us cill thi^ the "foreground variable") and which triable or 
entitle, ir h t >t>' n > a b-*cMrop for tho coiiparuson of interest, "ba.aadrop" 

the of iorviaq as i -jet of occasions for th*j contrast within the fir;>t 

ir-^pMi v a* van ihb T to bo nad*> rep^a t'^d ly . For example, in Figure ?*2a, ttw • 
i:i:,r"i"i a i- ni* illy c iunn ir i/a>d as "an r»J th^ T J,3. wa.> much mor*; prohib- 
it v> tn i: L w \ t ijj , wh'*r :aa in 1 Oh J tno 'J* S . w r> only sorwvha t more pr odac ti ve , " 
Her'* w * "or- -jro iad oon* raat l*j b^t.;-*^n coun* ri^a, and it m.i . twi cyT 
w t ; ' i ir ip r)f 1 1 \ \ o t'^n t year : . H ■ v* *r * 7, ^b, cont- unin'j th> ;a.^ 

:at^-nn a-, i; . tiTjri^'l *n " , on* <a "/) ty d*^ Ut;** i betw^^n ml 
l^^'b ^a-T i j ; v nf:rea;^d" , wi vn u > a^ forejr^uni cotitr ill, ar.d 
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countries a^ the backdrop* The ruin of thumb seems to be that the fore^rouni 
variable sho\:lu" be draw as the parameter (with specif .art3 la'.el^d by the 
individual values), and the background variable as the abscissa* 

Second, the designer should consider which contrast in trends it; to be 
^-pna^i. , wh: rh may not be the ^ame thing as deciding which variable fore- 
ground zni which variable is background- If a clear contrast is intended, the 
# 

variables should be assigned to specifiers so that the relevant contrast 
appears as a recognizable shape. For exaoplj, 7,2a seems far easier to rec* 1 
than 7*2b, perhaps because the narrowing of the productivity difference, the 
grapn's intended message here, cone^ through as a converging pair of lines, 
whereas in 7,2b it co.nos through as a difference in slopes, which does not 
connote a "narrowing" of di t f erences as sa liently* 

Our thiri rule of thumb, is straightforward; r ^an interval scale is highly 
suiced to a continuous axis, whereas countries, a nominal scale, ijs more suited 
to a set of discrete specifiers* 



< 



Thir5, all other things being equal, *juf finally, when one independent vari- 



able his a snaller nurcber of levels than the other, the smaller should Comprise 
th^ par letter (labeling individual specifiers) and the larger the abscissa., so 
an to reduce number of visual units in the graph, and to allow" the Comply: 

co/ipariso'i" within the nulti- value 1 variable to translate into the shape of a 
1 1 1 rather than the d if f'jrenc^ among a .set of endpomts (compare Figures 7 * \i 

INSIST FIGURE V. 3 HHRB 
^h>j je t'Vir heurVitics can often conflict if applied outside any particular 

f 

jor^.<'.. V-'^'^^i theie^conf 1 1 ct * can usually be 



resolved if one consider- the 



p->i*t y\tr^iry^ convey and the ^ to which the inform ition wiU bo pu^, 

I: r i^r: -r i carina resoLw a conflict, it is likely that any arbitrary 

^ p'i..;/! •) vili u /atis t acfory. 
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5* One framework or many? 

The final general question that must be settled before a display is gener- 
ated is straightforward: Should the information be presented in a single 
framework or in multiple frameworks? ^e main factor governing this decision 
is complexity; There shou 1 not be tou many perceptual units within a single 
framework* Thus, if there are more than four functions bein- plotted in a line 
graph, or one bar per point in a bar graph, or fiva slices in a pie chart/ or 
one surface in a surface graph/'then a multiple framework may be appropriate* 
One exception to this rule occurs when the Gestalt laws group sets of lines 
into higher-order units; when all or almost all the functions are similar and 
the point of the display is to emphasize this fact, all the lines should bo 
plotted in a single framework* Similarly, when the functions fall into two 
groups, with the functions wiichin each group being similar to one another, a 
single framework may be used* 

When multiple frameworks ars used, the designer must decide what will go 
in each framework* The selection of which independent variables to Put in 
which fr< reworks sbouid be governed by their similarity and their relevance for 
each other* That is, similar categories should be put in the same framework 
(if only to make it easier to comprehend what is there) and variables that will 
be compared together should be placed in close physical proximity so as to 
constitute higher-order shape patterns which can be perceived as units, this 
signifying a trend or pattern Q f trends directly* 

Finally, multiple frameworks often seem appropriate when several different 
dependent variables, measured in different kinds of units, are related to the 
same independent variable and are meant to be compared with each other* The 
reason chat multiple frameworks are generally appropriate is that the different 
sets of scale units along the y-axis are not easily linked with the appropriate 
specifiers within the graph* There is one have obvious exception to this rule; 
when the number of specifiers is two, and they can be clearly related to two 



vertical axes in a U-shaped framework by arrows or perceptual similarity (e.g., 
when the lines are the same color and boldness as their respective axes), then 
placing them on the same graph yields the advantages mentioned earlier; dif- 
ferences in the respective trends can be seen easily, as the nonparallel shape 
formed by the two lines* 

when two or more frameworks are used to display dependent variables 
measured in different scale units, the measurements are in fact scaled arbi- 
trarily with respect to each other, and thus the different frameworks need not 
be the same i^e* For example, one may want to compare numb.-: of suicides per 
year with the rising cost of food* Making both frameworks the same size allows 
for easy comparisons, but making one bigger emphasises the point that suicides 
have been risin?, as is illustrated in Figure 7*4* 

INSERT FIGURE 7.4 HERE 

3* Guidelines to drawing 

The following guidelines should be used when drawing a chart or a graph* 
However, many of th* specific pointers a re only relevant to graphs, which can 
simply be ignored when one is drawing a chart* in addition, we often use ter- 
minology specific to graphs per se, as was done in Chapter 5; the reader should 
realise that "axes" refer to a part of the framework, and usually correspond to 
"box" or "node 1 * if one is drawing a chart instead* 
Drawing a multiple framework display 

When laying out the separate subgraphs in a multiple framework display, 
the following guidelines should be obeyed* jf you don't need a multiple frame- 
work display, skip this section* 

1* The physical arrangement should lead the reader to examine the displays 
in a logical sequence* Readers in our culture will examine displays left to 
right and top to bottom* If a particular order is critically important, sub- 
displays can literally be connected by arrows indicating order-of-Inspection* 
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2* The relative visual salience (reflected by differences in line weight, 
color, and si2e) should reflect the relative importance of the information 
presented in each display* Hake the more important sub-display bigger or in 



some other way visually striking; if no display \p more important, make thstm 
all equal si2e and equally salient* 

3* The individual subgraphs should be clearly labeled* The labels should 
be closest to the appropriate subgraph, such that they are clearly associated 
with the correct display* 

4* If the same specifier elements are used in more than two subgraphs, use 
a legend to supplement direct labelling* Make sure that labels in the legend 
are clearly associated with the appropriate specifier elements* In this case, 
pair each label with a small segment of the relevant specifier element* Make 
sure the specifier elements are highly discriminable* Do not have more than 4 
labels in a single legend* (If you need more, be sure to label some of the 
specifiers directly, even though there may be redundancy*) 

5* If the same variable is discussed in two or more subgraphs, make sure 
the subgraphs have the same general form, with the variables being presented in 
corresponding locations on the framework (for example, two pie charts of the 
same data at different times should have data presented in corresponding 
"slices")* 

6* if the same variable is discussed in two or more subgraphs, the same 
units ought to be used along the framework* Unless there are extreme range 
differences (e*g*, orders of magnitude), the units ought to be laid out using 
the same number ot ticks per centimeter, starting at the same baseline* 

7* If one subgraph presents a second version of the same information 
presented in another subgraph, this should be clearly specified m the titles 
or by arrows showing the correspondence* if arrows or other visual means are 
used to establish the correspondence (e*g*, a drawing of a magnifying glass), 
you must be sure that it is clear how one subgraph relates to another, even if 
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additional labels are needed to specify the relationship (e.g., "When the years 
1980 and 1^81 are examined in detail")* 

8. The relationships between the subgraphs should be unambigucisly speci- 
fied by title** and/or by other visual means associated with the subgraphs by 
tht jstalt i*. ^s . 

9. TO the extent that subgraphs erform different functions, they should 
look sufficiently different so the reader will not assume they are showing the 
same thing (e.g., the scales should be labeled using different types of font). 

After deciding on the graph type for each of the subgraphs in a multiple 
display and deciding how they will be organized on the page, each individual 
display should be drawn according to the following guidelines. In this case, 
however, one should also keep in mind the general guidelines just provided, 
varying font or keeping it constant as is desired to emphasize differences and 
similarities in the information conveyed in the different subdisplays. 
Drawing the Framework 

The outer framework is the first, thing to draw. When doing so, keep the 
following rules in mind; 

1* The marks that define the outer framework must be grouped together by 
-e Gestalt principles so that the framework is clearly defined. Every neces- 
sary part must be present or obviously implied. 

2. If tick marks are used between scale values, tiiere should be no more 
than five before a heavier tick mark or a new scale value. 

3. Trie marks must be congruent with the idea being conveyed. Thus, an 
ordinal or nominal scale must be clearly demarcated. 

4. If the framework depicts, the depiction must be representative of the 
class of things it stands for. Further, the depiction must be chosen and drawn 
so as to be unambiguous. 
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5* Ideally, all parts of the framework should play a role in comiunicat- 
ing quantitative information* If for some reason (e.g., you use a particular 
depiction) they do not, make the superfluous parts lighter than Uie rest of the 
framework or clearly set aside. 

6. The axes should be unifor i continuous; if they are nc ; sure 
you are distorting things to make a particular point, in a way that the reader 
can detect and understand. 

If readers are expected to extract precise information, an inner framework 
is useful. The inner framework should be chosen after the specifier elements 
are in place. This is because you do not want the placement of inner framework 
elements to group with the specifier elements; choosing the inner framework 
after the specifier is in plac# allows one to avoid this pitfall, we will thus 
discuss construction of the inner framework after discussing construction of 
the specifier. 
Labeling the Framework 

Before you can put in the specifier, you need to label the framework. 
This is critical in a graph where you need to know what each axis represents 
and how the scale is constructed on each axis. 

1. Put on a title. The titl^ should state clearly what is being graphed 
or represented. The title should be recognisable as such because it is clearly 
set off from the rest of the graph; it should not be close enough to any line 
to be grouped perceptually with it. A larger font size will also prevent the 
title from being perceptually grouped with other labels or parts. 

2. Label each axis. The labels should be placed closer to the axis they 
label than to anything else, ensuring that they will be grouped perceptually 
with the right axis. Labels parallel to their axes are a good choice in com- 
plex displays because the Gestalt law of common fate will group label and axis 
together. 
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3* Put scale values on both scales* make sure they are closer to the 
correct tick mark than to anything else* 

4* Hake sure ali labels are legible an<3 will remain legible if the figure 
is reduced* (Chapter 3 presents a way of computing this beforehand*) Use 
arabic numbers/ not roman numerals/ anC ^vcid italics (Wright, in press)* 

5* if you use depictions as labels/ make sure the pictures clearly stand 
for what you want to label* A quick way to test this is to ask several people 
to provide the first name that comes to mind when they see the picture; this 
name should be the label you have in mind* 

6* Use words that are consistent with the text in which the display will 
be embedded and with common usage about the topic* 

7* Keep the graph as close to the text as possible* 
Drawing the specifier 

1* Make sure that the specifier elements are easily seen* Relevant dif- 
ferences in values must be discriminable even after photoreduction has taken 
place* 

2* Make sure different specifiers are clearly discriminable* Different 
shading should be usei with different bars, pie-wedges, and surfaces (but see 
below), and different widths and patterns should be used with lines (but see 
below)* If more than one line is present/ make sure all the segments of each 
line clearly are grouped together? this requires having different lines drawn 
in different widths, patterns and/or colors, such that the segments of any 
given line are more similar to each other than to anything else in the dis- 
play* 

3* In order not to mislead the viewer/ do not vary irrelevant integral 
dimensions (e*g*, height and width of a bar)* 

4* If shading is used, make sure differences in shading line up with the 
values being represented* The lightest (''unfilled") regions represent "less/" 
and darkest ("mo<st filled") regions represent *more"* Similarly, i n a divided 
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bar graph, shading of bars should proceed from lightest (unfilled) to darkest 
(filled) going from top to bottom* 

5* Avoid unnecessary depictions incorporating the specifier* If such 
decoration is irresistable to you, make sure it is representative of the f~*uit 
of tht. display* Also, make sure that the role of the lines as a specifiei. is 
not lost in their role as a picture* 

6* If specifier elements abut, make sure it doesn't look like they over- 
lapping; have a sharp line between them* If they overlap, make soiae of the 
lower one protrude from under to top one* 

7* Make sure there is a visible change in the specifier element every 
l.ime it represents something different* If a single line is used over a period 
of trials until a treatment is added, for example, make sure this point is 
marked somehow* Every meaningful difference should be clearly indicated by a 
perceptible difference in the marks, and vice-versa* 

8* If color is used, be *ure that the most important specifier element 
stands out the most; if no one element is more important, avoid using hues of 
different intensities or saturations for the different elements* 

9* If color is used, do not use values from the entire color scale to 
represent quantitative values (colors don't fall perceptually along a single 
continuum)* If colors are used as a scale, use variations of saturation, or if 
necessary, variations within the red-orange-yellow (in that order for low to 
high) family or the green-blue-violet family (as ordered); these variations do 
somewhat fall into a continuum perceptually* 

10* If color is used, make sure that values of color do not contradict 
cultural conventions (red is hot, green for safe, etc*)* 

11* If 3-D perspective is used, remember that volumes and areas arc not 
accurately read; avoid perspective effects if you want to convey precise 
values* Also avoid sharply oblique viewing perspectives, which distort quanti- 



292 



2QJ 



ties, or extra lines that turn 2-D surfaces into 3-D solid volumes if the extra 
lines have the potential to distract or group with other specifiers* 
The specifier in relation to the framework 

when absolute values are to be communicated; 

1, i\ specifier lines must be no thx + jr than the level of precision of 
the tick marks on the axes* In addition, include an inner framework, consist- 
ing of a grid pattern, as is discussed below. 

2. Keep specifier elements within the framework. If you must hav^ them 
extend beyond (perhaps to emphasize a point), remember that actual quantitative 
information will be difficult to extract* 

3* If the x axis is more than twice as long as the y axis, include a 
second y axis on the right of the framework* 
Labeling the specifiers 

1* Try to avoid using a key or legend* It is better to have the labels 
directly associated with the specifier elements* Ideally, the label should be 
closer to the appropriate specifier element than to anything else, allowing the 
Cestalt principle of proximity to provide grouping* if this is not possible, 
try having the label at the end of the line, in the wedge, in a bar or in a 
surface* If this cannot be done, connect the label with the relevant specifier 
element with an arrow* A key should be used only when a) there are too many 
specifier elements in too cramped a space or b) the same elements occur in more 
than two subgraphs in a multiple framework display. Even then, redundant 
direct labelling is helpful* if a key is used, put it at the top right, within 
the outer framework, of only a single framework is used, or prominantly above 
and in the middle of the display if multiple frameworks arc used. 

2* All labels must be legible, even at reduced sizes; as before, we urge 
avoiding italics and roman numerals* 
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3. There should be no more than four labels in a key* If there are, use 
multiple frameworks (subject to the caveats mentioned above)* 

4* Use the same font size and style for each member of a set of speci- 
fiers comprising a second independent variable (i.e./ the parameter) as wa** 
used to label the a^es(this indicates that the parameter h, - che same logi w l 
status as the axes). 

5. If labels are used in a key, make sure the connection between the 
label and the appropriate specifier element is clear and unambiguous. Associ- 
ate the label with a segment of the specifier by putting the label closest to 
the appropriate segment of a superior pattern, and be sure to use a segment 
long enough so that the pattern is easily identified. 

6* Use words that are consistent with those used in the text or commonly 
used to discuss the topic. 
Drawing the Inner Framework 

After the outer framework, specifier, and labels have been placed, you sre 
now in a position to draw an inner framework. An inner framework is useful 
when absolute values are to be read from a graph, given that they allow one to 
link portions of the specifier to the appropriate labelled pips on the axes* 

1. The inner framework should not group with the specifier elements or 
the labels. This can be ensured by always drawing the inner framework with 
thinner, lighter lines thaa those used to draw the other graphic constituents* 

2. Make the grain of the inner framework appropriate for the level of 
precision necessary. A coarse grid will not be of much help if detailed 
measurements are needed, and a fine grid will only get in the way if only 
general measurements are needed. 

3* Every fifth line of the inner framework (if a grid is used) should be 
slightly heavier, which will help the reader to track along any single line. 
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4* The ends of the lines of the inner framework should intersect the 
outer framework at one and only one place, and this place should be easily 
seen* This will ensure that the inner framework hooks up clearly to the outer 
framework, so that it maps specific labelled points on one part of the outer 
framework to specific points (preferably ^ ±nts that are perceptually isolable) 
on the specifier elements* 
Drawing the Background 

First, we recommend avoiding backgrounds if there is the slightest chance 
that they will impair comprehension* if you insist on drawing one/ draw it 
last/ because you want to make sure that it does not interfere with the 
information-conveying parts of the display* If the background is sufficiently 
dim or sketchy, it can be drawn first, but you then run some risk of having to 
re-draw parts of it* 

1* Make sure that the background is not too visually dominant* It should 
be visibly less salient than any other part of the display* 

2* Make sure that the background does not draw attention from the display 
because of its complexity or becaus* parts of it seem to group with parts of 
the display* if die background is sufficiently dim relative to the display, 
or of a different degree of fineness of detail/ this problem can be avoided* 
Every element in the background should obviously belong to the background and 
not to the display* 

3* if background figures are used, they should convey a message consis- 
tent with the point of the display* 
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CHAPTER 8: EXTENSIONS AND GENERALIZATIONS 

We have so far concentrated on how one analyzes and constructs charts and 
graphs* But at the beginning of this book we claimed that this focus was 
largely for .odological reasons, and that the results of our enterprise 
would in -he end have considerably more applicability* That is, we claimed 
that charts and graphs had the virtue of being highly constrained, and yet of 
having a wide variety of different possible types'* Thus, we expected that we 
would be led to develop a set of principles and techniques that could be 
generalized to other kinds of visual displays simply by modifying some of the 
requirements for charts and graphs proper* Let us first consider how we would 
extend our approach to the other display types noted in Chapter 2, and then 
consider a much broader extension of the current project* 

Generalizing to other types of displays 

The key to generalizing to other types oi visual displays is to realize 
that the systera we have developed does not hinge on the precise nature of the 
graphic constituents* We hoped to illustrate this by using both charts and 
graphs, in which the frameworks and specifiers have very different forms* The 
entire system requires only that there be a way of dividing a display into 
parts that specify different kinds of entities* Once this is done, one can 
proceed to describe these entities and the relations among them at the level of 
syntax, semantics, and pragmatics* All of the syntactic principles are appli- 
cable to any visual display* That is, the designer of any kind of display must 
ensure that the marks will be discriminable, must take into account possible 
distortions introduced by the visual system, must be aware of how marks are 
grouped together by the visual system, and must take into account the effects 
of processing priorities and limitations* The same is true for the formal 
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mapping principles* In -ny display, one must ensure that every oiark has only 
one interpretation and that all necessary marks are present, and one must 
ensure that the inter-relations among the marks themselves are clear* 

In contrast, the semantic and pragmatic principles developed thus far are 
not directly applicable to all kinds display* This does not present severe 
problems for the semantic principles, however; if a semantic principle dis- 
cussed thus far is relevant at all, it is applicable as it stands* If a 
principle is not relevant, it simply should be ignored* some principles will 
not be relevant when displays do not involve symbolic representation (i*e*, 
where lines represent via a convention, not via depiction)* For example, in 
many diagrams the display represents solely by depicting an object oy part* In 
this case, the only relevant principles ar* those that pertain to depictions 
proper (i*e*, representativeness, concept availability) * The same is true for 
the Pragoiatic principle of "contextual compatibility"* If a display is em- 
bedded in a context, it must use terminology consistent with that used in the 
context and it should neither "tell more or less" than is required in that 
context* However, the situation is not so simple with the other pragmatic 
principle, "invited inference"* The general idea, that stimulus properties 
may invite an inference, is applicable for all kinds of visual displays* But 
precisely how one accomplishes this will vary depending on the kind of display* 
Not only are some of the principles we describe in Chapter 4 not relevant for 
many displays (e*g», those not containing axes), but other principles which we 
have not developed will be relevant* For example, with maps, use of different 
numbers of elevation rings can convey the impression that a hill is in fact 
steeper or shallower than it is* We have not begun to work out the principles 
of invited inference that are relevant for each type of display, but arc con- 
fident that this can be done* 
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Let us now consider in more detail how vo extend the analytic and genera- 
tive scheme developed for charts and graphs to other types of displays* In so 
doing we must delineate the basic-level graphic constituents for each type, and 
must note which principles are apt to be irrelevant, and we must propose new 
principles that might be relevant to specific special cases* 

Maps ; Let us first consider simple "pure" maps, which contain a depiction 
of a territory with associated -labels* These maps can be composed of a single 
unit (e*g*, a map of a state) or can be composed of a number of sub-maps (e*g*/ 
a map of the U*S* showing state boundaries)* if the map is divided into sub- 
units/ these units are the "basic level constituents 1 * of the analysis* If line 
widths vary such that relatively small units are nested within areas demarcated 
by heavier lines, then the largest unit with heavy lines (which is not the 
entire area) are to be treated as the basic-level unit* (Recall that the 
basic-level is that which is as general as possible while still having consti- 
tuent members that are as similar as possible)* Ihe relations among the con- 
stituents are simple contiguity: regions that abut are organized as represen- 
ting territories that abut in just that way* 

Pure maps of the sort considered above are a rarity/ occuring only in 
special contexts (e*g*, globes of tlie world)* Most maps are designed with the 
intent of conveying specific information about a territory* Road maps tell one 
about highways, rainfall maps about rainfall/ census maps about population/ and 
so on* These maps use conventional symbols as specifiers/ relating regions in 
specific ways* A line is taken to be a road (with a wide yellow one as a 
superhighway, a narrow red one as a backroad, etc*), a region of dark blue to 
be one in which rain falls over 300 in* a year, a region of white to be one in 
which less than 4 in* of rain "falls per year/ etc* These maps use the depic- 
tive component — the territory — as a framework and the lines, regions and so on 
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as specifiers conveying information about specific relations among features of 
parts of the territory* 

Now let as consider slightly more complex maps, in which a visual table is 

i 

superimposed over the map* In this car«s, there might be a spike over each 
location, with taller spikes represent Fi g greater populations* Or a circle 
might be drawn, with its area representing the average yearly rainfall at that 
location* In this case, the map is serving to label the elements of the visual 
table (by providing the location- which is relevant for that information), and 
the magnitude of the spike, circle or whatever is interpreted visually as 
indicating the relative amount of whatever variable is represented* 

In the rare cases in which a map represents solely by depiction, the 
semantic principles concerned with proper pairing of a symbol and a concept are 
irrelevant* when color, texture, or some other visual property is added to the 
map to convey information symbolically, either as a specifier or a label, then 
all of the semantic principles developed previously are relevant* Similarly, 
* when a visual table is imposed over a. map, now all of the semantic principles 

are relevant* In this case, the formal principles are relevant not only to the 
map, but to its relation to the table* 

The pragmatic principles one might want to develop for maps would depend 
on the specific kind of map being considered* A topological map can be modi-* 
fied to emphasize or de-emphasize height differences (e*g*, by spacing of 
^ rings); a road map could emphasize or de-emphasize congestion (by varing the 

size of the marks used to represent reads); a map of population density could 
emphasize or de-emphasize the unevenness of distribution Cby varing the size of 
^ the region in which population was averaged over), and so on* It is impossible 

to work out all such principles beforehand, but the over-riding idea is the 
same as for charts and graphs t be aware of the distorting effects of the way 
^ we describe appearances (as big, little, etc*), and strive to avoid them when 

they have the potential to mislead* 
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• Diagrams : Diagrams are schematic pictures of objects or entities* A 
diagram of a machine, fcr example, represents purely oy depiction* A diagram 
of wind patterns, in contrast, represents symbolically* Many diagrams include 

^ both a depictive and symbolic component, as occurs in "exploded view" diagrams 

in which parts of an object are separated an d connected by arrows (the pict 
of parts are depictions, the arrows are symbols)* The components of diagrams 

♦ are determined in two ways: first, the actual components of the represented 
object in part determine how one should analyze the diagram* A portion of the 
diagram corresponding to a distinct component of the machine will be analyzed 

# as a constituent unit* Second, heavy lines or other perceptual factors (e*g*, 
color differences) may also serve to define a part as a separate ur It; even in 
this case, however, the unit so defined often will correspond to an actual part 

# of the object itself* The relations among constituents will again be ones of 
spatial contiguity and of functional contingency (how one part can affect 
another)* In many cases, the diagram will not have a distinct specifier; it 

9 will merely depict the entities of interest* in others, howevor, one part may 

be of particular interest vis-a-vis how it functionally relates ("pairs", to 
use the term introduced in Chapter 2) two other parts (serving as parts of the 

# framework)* For example, a diagram may be intended to show how a given kind of 
crankshaft fits in an engine* Now, the crankshaft serves as a specifier, and 
what is important is how it relates to the other components of the engine* Or, 

• in the case of the exploded diagram, the parts are related together by arrows, 
which serve as specifiers* 

Diagrams behave almost exactly like maps in how they represent informa- 

• tion, except that they depict some object or entity rather than a territory* 
Thus, parts can be labeled by words or visual properties, and a visual table 
can be superimposed over a diagram (e*g*, using different colors to show the 

0 temperature at each point of an engine)* Thus, the comments offered about the 
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applicability of the semantic and pragmatic principles are equally appropriate 
here* 

Visual ta bles : A visual table is like a numerical table/ except that 
values are l ^resented by visual *operties of symbols or depictions* For 
example* increased amounts of oil could be represented by larger pictures of 
oil barrels, by bigger blotches, or by darker swatches of gray* The consti- 
tuents here are the specifiers and labels; if there is a framework/ it serves 
merely as a way of labeling the specifiers* In contrast to graphs/ the meaning 
of the specifiers does not derive from mapping parts of the framework to other 
parts of the framework* Thus, an analysis of a visual table involves isolating 
the individual specifiers, ani ensuring that they are properly identified 
(either via recognizing a depiction or associating a label) and that they are 
properly interpreted (<**g*/ with bigger shapes representing more of some 
quantity)* m these cases/ the horizontal formal mapping principle may not be 
applicable if labels are directly associated with each specifier; if labels 
must be extracted via i. key or via a framework, then this principle is appli- 
cable* Other than this, all of the syntactic, formal and semantic principles 
described in this book are applicable to these displays* Again, however, the 
pragmatic principles are less clearly related* To the extent that simple size 
represents quantity/ however, then all of the principles of invited inference 
developed in Chapter 4 that affect apparent size (e*g*/ varying irrelevant 
integral dimensions) will be applicable here* 

As should be clear from even this brief treatment, the core of the system 
we have developed in this book is easily generalizable to other forms of visual 
displays* All of the display types considered above are less constrained than 
a high-precision graph/ in which points along a specifier must relate together 
a specific pair of points* one lying on each axis. The principles we needed to 
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consider when constructing such displays encompass those we need to construct 
good maps, diagrams and visual tables* The principles that dictate how to 
emphasize a particular point, however, depend in large part on the point itself 
and the way a given display works* 

II* Species of Visual Pisplays 

The project described in this book is an example of how a body of facts, 
concepts, and theories developed in "pure" research can be brought to bear in 
the service of dealing with an "applied" problem* One of the reasons it is 
interesting to engage in this kind of exercise, in developing a technology from 
a science, is that ^n the course of developing the technology one often ends up 
inspiring new "pure" science* This is true in the physical sciences, and it 
should not be surprising that it is true here* Thus, in this last section of 
the book we would like to show how this project on charts and graphs feeds into 
a more general domain, the study of visual representation as a whole* 

Let us begin by considering different types of visual displays, using a 
more general taxonomy than the one just considered* In this taxonomy, we will 
consider three broad types of uses to which displays can be put* Further, we 
will use a more general taxonomy than one dividing displays into charts, 
graphs, maps, diagrams and visual tables? as should have been clear from the 
foregoing discussion, some of these display types are almost variants on a 
common form* Consider t"*e taxonomy presented in Table 8*1* The columns of the 
table correspond to different types of displays* The first type are "intrinsic 
configurations", where the lines do not refer to anything else* A diagram used 
in geometry is of this type, as is a purely decorative pattern* The second 
type are "models", where the lines refer to something else, serving to portray 
that which is referred to* A'drawing of an object, scene, layout, or a map is 
of this type* The third type are "symbolic" representations, where the lines 
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refer to something else, but that something else is not actually shown (usually 
because it is an abstract idea or state)* Charts, graphs, and abstract "nota- 
tions" (e.g., Venn diagrams) are of this type, intermediate cases are of 
course possible, but these are formed when elements of a display fall into 
different categories, one example of this is an "^ploded diagram' 1 , in which 
pieces of a device are drawn separately with arrows indicating how the pieces 
fit together; the pieces are models, the arrows are symbols. 

INSERT TAELE 8.1 HERE 
Now* let the rows of the table correspond to different uses of displays. 
The top row contains displays that are used merely to illustrate or present 
information, h drawing used to illustrate a rhombus, to show the layout of a 
house, or to indicate rising prices by a rising line in a bracket (i.e., a 
graph) fall into the three columns, being examples of intrinsic configurations, 
models, and symbolizations, respectively. The next row corresponds to displays 
that are used to help one solve a problem. In this case one does more than 
simply extract information from a display; one u*es the display to help one 
reason through to a solution to the problem represented in the display, h 
diagram used to prove a theorem in geometry, a picture of pulleys used to 
anticipate what will happen when one pulls the rope a certain distance, and 
Venn diagrams used to solve 'logic problems fall into the three columns along 
this row. Finally, the last row contains displays that are generated when one 
is trying to discover the best way to formulate a problem in the first place. 
In this case one often may generate numerous different displays, considering 
the implications of each, before making o.»e that seems to provide insight into 
how to look at a problem. Presumably some of the diagrams Euclid drew belong 
in the first column, some of the images Einstein reported definitely belong in 
the second column (e.g., of himself riding on a beam of light, when he first 
began to ponder relativity), and scribblings created by untold numbers of 
mathematicians belong in the third column in this row. 
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The foregoing taxonomy Is interesting in part because it defines a hier- 
archy of principles for visual display design* jn the first row we have the 
syntactic principles that dictate how marks will be organized, encoded, and 
retained J * active memory* For example, marks near each other will tend to be 
grouped t^ether, marks that are drawn with heavier lines will tend to be 
noticed sooner, and too many perceptual units will be difficult to apprehend 
(see Chapter 3). These sorts of principles apply to all of the cells in this 
row* In addition, in the secoriStwo cells we have semantic principles that 
dictate how patterns of marks will be interpreted as conveying meaning (intrin- 
sic configurations do not refer to anything else, and thus are not interpreted 
semantically) * The semantic principles that are appropriate for models are 
straightforward; they deal with the way pictures are seen as resembling 
objects* The external and internal mapping principles are applicable here, as 
are the principles of representativeness and concept availability* The right- 
most cell in the first row inherits not only the syntactic principles that are 
relevant to une first cell, and the .semantic princples that are relevant to the 
second, but also adds yet another layer of semantic principles to these princi- 
ples* The semantic principles that apply only to symbolic displays are more 
complex, focusing on how variations in marks (e*g*, size, color, texture) map 
into conceptual dimensions (see Chapter 4)* For example, bigger marks will be 
interpreted as representing *ir.ore" of some thing* 

INSERT FIGURES 8*1 AND 8*2 HERE 

The principles assigned to each cell in the first row are inherited by the 
corresponding cells in both rows beneath them* That is, these principles are 
equally valid for diagrams used merely to convey information, used to solve a 
problem, or used to help formulate a problem* in addition, in the second row 
we add another set of principles* These displays are not simply read, but are 
actively processed in the course of using them bo solve problems* Thus, we 
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can add a second set of principles here, which specify how the various displays 
can be processed and the best ways of processing them* Imagery would appear to 
be a key means by which these sorts of displays are used* Moving across the 
columns: Imagery is often reported when subjects try to solve geometry 
t /lems, such as proving that two regions of a diagram have the sane area* In 
this case, parts may be imaged and rotated, shifted across the page, expanded 
or reduced in size, and compared to other parts* For example, consider Figure 
8*1* Does the inner square have^ half the area of* the outer one? One way to 
solve this problem is to fold the corners of the outer square so that the tips 
meet in the center, and to "see" in the image that in so doing, one neatly just 
covers the inner square* Kosslyn (1980) specifies the principles that 
constrain how such images operations can proceed* The principles of imagery 
processing that are relevant here generalize to the other two cells* Moving to 
the middle cell, imagery is often used when one anticipates how a model would 
look when in motion* In this case, imagery is used to conduct a kind of 
"simulation" on the diagram, with the -aim of mimicking the corresponding actual 
event (such as by imaging how gears will interact when the first one in a 
series is twisted clockwise)* r*or example, consider Figure 8*2* If the left- 
most gear is twisted clockwise, which way will the rightmost gear move? 
Finally, in the last cell in* this row, imagery is used to manipulate symbols* 
For example, Venn diagrams are sometimes reported to be "seen" to slip in and 
out of one another and swinging about in various ways when one is trying to 
discover if a certain conclusion follows from a set of premises* In all three 
cells of this row, then, we not only have the principles inherited from the 
first row, but we have additional principles that dictate how displays should 
best processed to achieve certain ends* It seems safe to say that we have 
just begun to make progress in discovering and formulating these principles. 



One of the consequences of considering together the two sets of princi- 
ples, those derived from studying charts and graphs and those derived from the 
study of mental imagery, is that they will interact* That is, in designing a 
diagram to be used as an aid to solving problem- one must consider not only 
factors tii it pertain no the diagram itself, but also factors pertaining to how 
easily imagery can be used in conjunction with the diagram* For example, by 
varying the line weights some parts of a diagram could be emphasized over 
others, or some organizations of- the figure made more salient than others* 
Depending on what the illustrator wants the viewer to do with a diagram when 
using it as an aid to solving problems, different stimulus factors can be 
varied bo encourage different imagery manipulations* For example, in Figure 
8*1, if the small triangles that are to be imaged folding are emphasized with 
heavier lines, this might encourage people to attempt to image those parts 
being manipulated in different ways* This conjecture could be studied 
directly, and in fact the entire realm of diagram design fo~ problem solving is 
ripe for study* fc 

Finally, the bottom row of the table inherits all of the principles that 
pertain to the previous two uses of displays* But now we must add another set 
of principles that are specific to these kinds of displays , namely those that 
pertain to how displays should be created bo help formulate a problem* These 
principles will be intimately tied up with principles of creative thinking in 
general, and remain a mystery at the present writing* 

Thus it is clear that the attempt to use the available data, concepts and 
theories in an applied setting has not been of mere technological interest* 
The principles and theories we have developed can easily be used as building 
blocks in more general projects addressing broader issues of representation and 
of use of visual information* ■ 
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Ill * Conclusions 

In this book we have attempted to accomplish three things; First, speci- 
fically we have tried to discover what makes a good (or bad) chart or graph* 
This goal has resulted in an analytic scheme which one caw use to diagnose the 
problems with a given chart or graph and a set of guidelines to help one c 
struct good charts and graphs in the first place* Second, we have tried to 
develop a general conception of what is going on in the head of a reader when 
he or she is extracting inforuiation from charts and graphs* This theory was 
useful in part in its role of providing heuristic guidelines for the construc- 
tion of good charts and graphs* Finally, we have tried to shoW how this parti- 
cular project is just the tip of an icebergi not only can our specific guide- 
lines be generalized to other kinds of visual displays, but the theoretical 
framework we have developed can serve as the foundation for scientific work in 
more general problems of visual representation* We are painfully aware of the 
deficiencies of our accomplishments on all three counts, but are encouraged by 
how easily our accomplishments were achieved and how clearly the issues and 
questions have presented themselves* We hope that this book' provides both 
practical tools for illustrators, and inspiration to other researchers to con- 
tinue to demonstrate that scientific approaches to psychology have much to 
offer society at large* 
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Why so lute? 



TABLE, Il-lt- 



Domain Scale Range Scale Example 

Nominal Nominal States w/ and w/out 

Capital Punishment 

Nominal Ordinal States by rank in 

coal production 

Nominal Interval Students by score 

on acheivement test 

Nominal Ratio State* by coal 

production 

Ordinal Ordinal Ranked oil production "by 

ranked coal production. 

i 

Ordinal Interval Rank in classes'by acheive 

ment test score 

Ordinal Ratio Ranked coal production 

by oil production 

Interval Interval . Math acheivement score by 

english acheivement score 



Interval 
Ratio 



Ratio 
Ratio 



Acheivement scores by 
hours/week of TV watching 



Coal production by oil 
production 



E xamples of Information Available 

Frequencies in cells; nan^d things sorted 
into classes on all/none basis. 

Allow N(N-l)/2 inequalities statement to 
be made. 



Map N things into an infinite amount of 
classes. Camparispn of t inferences. 

Nonarbitrary zero point* Ratio comparison 
of items possible. 

Relative ranks* Comparison of disparties 
in ranks* 

Numerical assignment of the re* "ive posi- 
tion of some characteristic of \ item* 

Relative difference or ratios of oil for 
different ranks* 



Differences on both dimensions* Difference 
in one dimension as a function of difference 
in other. 



Mapping into nonarbitrary zero point scale 
specified relationships* 

Absolute amounts* differences in amounts, 
ratios, for both dimensions va^ie of one 
dimension as a function of o' /r 
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• REORGANIZATION 



• CAPACITY LIMITS 



KNOWLEDGE 



SENSORY INFORMATION 
STORE 
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. SHORT-TERM MEMORY 




/ 

LONG-TERM MEMORY / 

/ 
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• DISCRIMINABILITY 

• DISTORTION 

• ORGANIZATION 

• PRIORITIES 



• DESCRIPTION 

COMPATIBILITY 
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Figure 1: Examples of the Three Basic- level Constituents 

of a Chart or Graph 
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AVERAGE STOPPING DISTANCE 



MILES 

PER 

HOUR 



1^ 



SSFHAVERAGE 



AVERAGE 

BRAKING 

DISTANCE 
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20 



30 



40 



50 



60 



FEET 50 



TOTAL 
DISTANCE 




NUTR. i iONAL INFORMATION PER SERVING 



Nutritional cont. i 

wtA o o'o o 

vit. B1 c O O O 
vit. B2 O O O O 
vit. C 
ca!c. 



iron 
prot. 



o o 
o o o o 

©000 

,0000, 

I v 

needed per day 



g ERJC 
!_ 




SERVING SIZE ONE CUP 

SERVINGS PER 

CONTAINER 8 



protein 4g 

fat Og 
carbohy. 38g 



-> 170 kilocalorie 
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Table 1 



Summary of Cited Recommendation for 
size of Displayed Letters 



Condition 



Visual Angle 
In 

Minutes of Krc 



Normal Acuity 

(Snellen E Chart) " 

Reasonable size (of numerals) 

(Murrell, 1965, Fletcher, 1972) 

preferred size (of numerals 
(Duncan and Kons, 1976) 

MIL-STD-1472B (1974) 

General Labels, good viewing 

Noncritical data 

Critical data, fixed position 
High Luntinance 

Low Luminance 



10 



23 



16+ 
6-24 

12-25 
19-37 



[■•5 



se 
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Table 2 

CONVERSION FACTORS FOR LUMINANCE UNITS 



Foot* * ' Mflli* Candle* Candid Candles 

Unit* JaosberU L&mberU lamberU per aouare per square per squa** 

Inch > foot centimeter 

fL-L- 1*076 X 10-* 1-076 . 2.21 X 10"* 3.18 X 10"* 3.43 X 1CT< 

L.,^ * ^ X 10* _ 1.0 X JO* 2.051 2,£6 X 10* 3.18 X KT* 

mL X 10"* 1.0 X 10-* 2.054 X 10"* 2.957 X 10"* 3.3*3 X 10* 

c/Ift*. 4.52 X 10» 4. 87 X 10"* 4.S7 X 10* - 1.44X 10» 1,55 X 10-* 

c/ft* 3.14 3.38 X 10-* 3.38 tM X KT* * * 1.075 X 10-* 

c/cm* X 10* 3.14 3.14 X 10* t.4$ *.29 X 10» + ^„ 

Kt>ie: Value in uiOtainle/t-liand column tirot* ooDvenfon factor equal* vijutlo uruU*kowa at top of column. 
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Table 3,1 

liOO mO CONE VISIOH OF THE HUMAN EYE 



Cone 



Rod 



Distribution .(ca* 7 million) 



Jtetmjl location 
Ncuf proceiung 
Peak wavelength 
tumifllfiCO teuet 
Color vilton 
Qirk irijptltiOn 
Split it resolution 



Concentrated i\ center. feww in periphery 

DiKrrmirutivC 

SSS nm 

Daylight (t to tO T ml) 
Normally tnchromjrjc 
Ripid (ci. 7 min) 
High acuity 



TempotJl revolution Fi*f reacting 



(ca. 120 million) 

Geneuf in periphery, none in love* 

Summjtivc 

SOS nm 

Night (10-* to t ml) 
Achromatic 
Slow (ci. 40 minj 
tow icutty 
Slower reacting 



:RLC 



60 
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Table 



Differential Sensitivities for Retinal Variables 



fetlnal 
Variable 



Dimension 



Differential 

Sensitivity {%} 



Method 



Stimulus 
Range 



S tandard 



Viewing 
Conditions 



Wo* of 

Subjects 



Reference 



Size 



line length 



Area 



tfumerosl ty 



*.l 



6,0 



20.4 



limits 



Constant 
Stimuli 



3.4-6.$ cm 
8.4-11.6 
13. 4-1 6 



5-15 dots 
13-25 
26*50 
70-100 
120-180 



5 cm. 
10 i 
15 



15 dots 

25 

50 
100 
180 



non simultaneous 60 



nonslmul taneous 



Ouo 
(196?) 



Bal rri 
U909) 



Tavos 
(1941 



Color 
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Hue 



Saturation 



Brightness 



2*0 



1,4 



Constant 
Stimuli 



Constant 
Stimuli 



Adjustment 



410-500 m\i 
510-630 



simultaneous 



25% purity 

35 

30 

65 

80 

0.62r?24.f> ml 
61 



simultaneous 



simultaneous 



Slonol and Dlmmj 
(1902) 

Slegol (1964) 



Panack tind Stevens 
Imlou and Stevens 



tovry 



tcont.) /* 
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Table 4 tcont,) 



Retinal Dimension Differential Method Stimulus Standard Viewing no, of 

Variable Sens! tivity (\) Range Conditions subjects Reference 



Distortion 
of 
Square 

Distortion 
of 
Diamonds 



1,37 



4,8 



Single 
Stimuli 



Absolute 
Judgments 



tt 



25 x 20 cm 
to 

25 x 30 cm 

,92S-1,07S 
(height /width 
fatio) 



2S x 2S cm nonsiir.ultaneous 



height/width 

ratio of nonslmultaneous 
1,00 



20 



Vcniar 
<19K» 



Kvlly mid Bliss 
U971) 



* J WD calculation differed from normal convention 

+ Tor the Paneck and Stevens experiment 

' Seo Figure %\ 

■* Standard varied in 10 mu steps along total stimulus ran<je 

l Kor a description of this method* see woodwertb (1938) 

t Similar to constant stimuli 




< 

Table £ 

NINE EQUALLY - DISCRIMINABLE SURFACE COLORS 



MurtseH 

Code book Excitation Dominant 
Huee number number purity wavelength 



1 
2 
3 
4 

ft 

7- 



1.5 


3R 


37,2 


3 


9R 


es.8 


5,5 


AYR 


81,8 


8,5 


1GY 


76,0 


U-5 


3G 


27,5 


15 


7BG 


35,0 


IS 


OB 


56,5 


20,5 


9PB 


52.7 


24 


3RP 


36,5 



629 
596 
582 
571 

m 

491 
481 



8 WB 52.7 460 

* 24 3RP 3fi_A 5io 
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TABLE 

POWER LAW EXPONENTS FOR LIKE LENGTH 



Ho* Exponent Variability* 
Measure 



Method 



Stimulus Range 
(cm) 



Standard 
(cm) 



Location of 
Std 1" Ramje 



Modulus Ho, of 
Subjects 



Reference 



1. 



1,0? 



.78 



magnitude estimation 2*1-15*9 
magnitude estimation *3-17«7 



Bjorkman* Stranger 
( I960) 

8Jorkman* Stranger 
(1960) 



CO 



5* 



6* 



7. 



9, 



1.11 

1.00 

.98 

1.02 

1.07 
.93 



ratio estimation 



1.3-2.75 



W.An 



Hi A. 



Oxy v *05 

Oxy - .03 

0 * .10 
Ox*y * *00010 



.97 Ox*y * .00008 



magnitude production 1.3-254.0 



magnitude estimation 1*3*83*8 
(based on apparent Length) 

magnitude estimation 1*3-83*8 
(based an physical length) 

magnitude estimation' 2*4*9*3 

magnitude, estimation 2*0-10*2 

(o£ circle diameters) : * 

magnitude estimation 2*0-10*2 
(of vertical Lines) 



13*5 

8.9 

8.9 

■ 7.7 
10.2 

10.2 



LOW 
LOW 
LOW 

High 
High 

High 



11. A. 



10 



10 



10 



10 



, 'not 
assigned 



not 
assigned 



10 Ekman* JUnge 

(1961) 

10 Stevens* Guirao 

(1963) 

10 Teghtsooni: 
(1965) 

10 Tegbtsoonian 
(1965) 

36 Rule (1966) 

40 Stanley 
<1967) 

40 Stanley 
(1967) 



T A modification of the method of constant sums 

* Hie different measures of variability used by investigators aret oxy - sample of std dev from regression (log-log plot)* o - std dev for distri- 
bution of individual subject exponents* Ox^y - residual variance about regression (Lo9-Log plot)* R - range of individual subject exponents* Rv - 
width of range of indivifliiAl subject exponents 

*• ti*A* * parameter is Hot Applicable 
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TABLE 0 * 
(Continued) 
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m 








POWER LAW EJCPOHEHTS FOR tIMC LENGTH 












Ho. 


£>cponent 


Variability* 
Ho A sure 


Method 


Stimulus Range 
tern) 


Standard 

<C*> 


toe At ion of 
St J in flange 


fcodulus 




No. Of 
Subjects 


" fteferencu 


10. 


1.M fc 


Rw » .07 


magnitude estimation 


20.3-M35.4 


30.4 


Lou 


12 


18 


Miller, 


SheL 


11. 


t.OO* 


Rw - .24 


magnitude estimation 


20.3-195.4 


91*4 


Middle 


36 


18 


Miller, 


Sheldon 

t 1vo9l 


12. 


t.04 t 


Aw » .35 


magnitude estimation 


20.^-185.4 


t52.4 


Hi4h 


60 


18 


Killer, 


Sheldon 


13. 


.94 


R « .73* 1.39 


magnitude estimation/ 


1.27-20.32 


none 


H.A.** 


none 




24 


Ouda t 19751 



Bach stimulus was a group of six parallel horizontal linear lengths within a stimulus group were uniformly distribute) with a range of 30 cm* 

Subject's estimated average length for group. * 
* The different measures of vendibility used by investigators are* owy sample* std dev from regression Clog-log plot), o - std dev for distribution 

of individual subject exponents; Ox 2 y - residual variance about regression (log-log ploth R - range of individual subject exponents, nw - width of 

r<*nge of individual sub3ect exponents * , J 

** H< A. - parameter is Nbt Applicable 



3. 



335 



TABUS ^ 

POWER LAW EXPONENTS FOR AREA OF yARlOUS FIGURES 



No. Exponent Va rlabil 1 ty* 
Measure 



Method Stimulus Ratio / Standard 

i * Wax Aroa/tUn Area Std Area/ 
Min Area 



Location of Modulus MO * of 
St<j In Range s.t*>ji!Cta 



Reference 



Circles 
1* 
2* 



*06 
*96 

r*20 



tt 



ratio setting 
ratio estimation** 



ratio estimation 



tt 



9*0 , W»A* 

7.0 N.A* 



26.6 



N*A* 



N *A* 
N*A* 

N*A* 



N*A* 
>VA* 

N*A* 



EKman 

tJjorktfoin, Strange 
( 1960 ) 



Djorir 1 in , Strungu 
' *>60) 



4* 



*7e 



ratio estimation 



tt 



N*A* 



W*A* 



N*A* 



ftjorkmartr Strange 

( mo ) 



.ao c 



magnitude estimation 



2*30 



Middle 100 11 Efcman, LUtlirum, 

W> lU*iiA-GlbM>n 
( Ht.1) 



*9S 



magnitude estimation 



2*1 



Low 



10 



Kknunr Juiujc 



7. 



1*0S 



magnitude estimation 



4*S 



1*0 



Low 



10 



{ W6 1 ) 



8* 



*99 



magnitude estimation 



9*S 



» no 



LOW 



10 



Ekman, Junge 



Squares and circles as stimuli (data was pooled) 

rttitlo setting Is a modification of fractionation* Ratio estimation is a Modification of tlu* nyjfchod of constant flum* 



tt 



* The different measures of variability used by investigators «rei Oxy - sample std dev from regression H<wj-lo«j plotJ, i* * «t»l ,l*v for diittihu- 
tton of indivic1n.il ;irhj<'ct i>xjv>nents, - residual variance about regression, llog-loy plot), ri - range of individual smIj^ ( t ''«i^ii' v «t'i, *v - 



"*'Uh of range of t'ldtvtdiial subject exponents 



ERIC 



V< - par^uvrter ts fto*. Applicable 
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TABLE 7 
(Continued ) 

POWER LAW EXPOHENTS FOR AREA OP VARIOUS FIGURES 



No. Exponent Variability* 
Measure 



Methcd 



Stimulus Ratio Standard location of 
Max Area/Win Area Std Area/ Std in Range 
Hln Area 



Modulus 



Wo. of 
Subjects 



Reference 



Circles 

9. 1,03 



10. 



12. 



13. 



14. 



15, 



16. 



.76 



11, 1.03 



.70 



.69 



.91 



.59 



.55 



ayx » .06 magnitude estimation; 91 

(physical area) 

ayx * .05 magnitude estimation 91 

(apparent size) 

<J* .23 magnitude estimation 210 



magnitude estimation 12 T 



magnitude estimation 1000 



magnitude estimation 4.7 



P> .99 magnitude estimation 179 

(apparent size) 

p> *gg magnitude estimation 3015 

(apparent size) 



25 



25 



51 



11.0, 17.4 

always present 

49.0, 75.5 



Middle 
Middle 
Middle 



tt.A. 



tt.A. 



tow 



Low 



10 



10 



10 



«.A. 



tt.A. 



10/100 



10,100 



10 gr*d 
students 

TO grad 



undergrads 



Teght soon lan 
(ig65) 

Teght soon i an 
( 1965) 

Rule (ig66) 



Manhours Hoeman 
( ig69 ) 

M» £ R* Teight- 
soonian ( 1g71 ) 

Vogelf Telght- 
soonian Mg72) 



■ru-Millaru 
V \ *t et al* 

1974 
experiment 1 



tt 



Ratio setting is a modification of fractionation. Ratio estimation is a notification of the f*ethod Of constant Rum* 



• the different measures nf variability ujigd by investigators are* Oxy - sample std dev from regression (log-log plot)/ <J ~ std dsv for distribu- 
ter nf individual subject exponents / a xy - residual variance about 



ERIC 



regression (log-log plot), R - range of individual subject exponents, Rw - 
4th of rangf? of imH-fKtual subject exponents* f> - correlation betwnen log ■!* & loji[» 
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TABLE 7 
(Continued) 

POWER LAW EXPONENTS FDR AREA Of VARIOUS FIGURES 



Ho* 



Exponent 



Circles 



17* 



15* 



19* 



20* 



2f* 



22* 



23* 



V«iri.ihility* 

MiM^urt* 



Method 



*92 



*ai 



*59 



*66 



*S4 



Stimulus Ratio standard Location of 
Kax Area/Min Area Std Area/ Std in Range 
Kin Area * 



Modulus 



Ho* of 
Subjects 



Reference 



ii > *S9 maynitrt .ion 179 

(physical area) 



p > *99 magnitude production ' 3075 

{physical area) 

l> > *99 magnitude estimation 

(apparent size) 

0 > *99 magnitude ji r<>1 Motion 3075 

(apparent sizei 

i* > *99 magnitude estimation 179 

(physical si&e) 

f> > *99 magnitude production 3075 

(physical si*e* 

p > *99 magnitude estimation 85 

(physical area) 



H*0 or 17*4 
always present 



48*0 or 75*5 
always present 

11*0 or f7*4 
presented once 

48*0 or 75*5 
presented once 

11*0 or 17*4 
presented once 

48*0 or 75*5 
presented once 

1 

always present 



Low 



Low 



Low 



tow 



Low 



Low 



Low 



10 er 100 
i 
i 

10 or 1QQ 
10 or 100 
10 or 100 
10 or 100 
10 or 1M 
1 or f0 



tt 



Ratio setting is a modification of fractionation* Ratio estimation is a nodificAtion of the method of constant sums 



Mac Mil Ian* 
Koschetto ct al* 
(1974) cent* 
experiment 1 



KacHUUn, 
Koschetto et al. 
11974) cont* 
experiment 2 



* The different measures of variability used by investigate*. *rei tfxy - sample std dev from regression (log-log plot)* 0 - std dfcv fof distribu- 
Q " 'on of Individual subject exponents # Ox y - residual variance about regression (log-lo-j plot), R- range of indJ vidua 1 subject exponents * Rw 



:j^Jj^"Lt1th of ra^ge of individual subject exponents* p - correlation between lot) ^and lo<|iN 
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TA&tE 7 
(Continued) 

POWER 1AW EXPONENTS FOR AREA OP VARIOUS FIGURES 



Wo* 



exponent 



Variability* 
He a sure 



Method 



Stimulus Ratio Standard 
Max Area/Min Area Std Area/ 
Kin Aroa 



Location of Modulus No* of 
Std in Range Subjects- 



Reference 



Circles 



24 « 



25. 



26* 



27. 



26. 



.97 



.00 



.7*; 



.70 



.71 



p > *99 magnitude estimation: 

(physical area) 



p > .99 magnitude estimation 

(physical area) 

0 > *99 magnitude estimation 

(physical area) 

p > .99 magnitude estimation 

(physical area) 

p > .99 magnitude estimation 

(physical area) 



65 



65 



65 



65 



65 



. 13.6 Middle tO or 100 

always present 1 



65 

always present 
1 

presented once 

13,6 
presented once 

65 

presented once 



High 



Low 



Middle 



High 



too or 1000 
1 or 10 
to or too 
loo or tooo 



MacMil lan t 
Moscbetto et a) . 
1 1974 ) cont * 
experiment 2 



tt 



Ratio setting is a modification of fractionation. Ratio estimation Is a rrodlf lcation of the method of constant sums 



* The different measure* of variability u*ed by investigators arei o xy - sample std dev from regression (log-Log plot), std <Jev for distribu- 1 
tlon of Individual mihject exponents* ox y - residual variance about regression (log-log plot)r R- range of individual subject exponents^ mv - 
vldtn of ranqe of individual subject exponents, p correlation bQtween log <fr 6 log ip* 



ERIC 34* 



34J 

' 1 K ) 



Cx^oncnt 



Cubes 



t, KOI 



2* 



.07 



3. .72 
Octaherirones 



4* 
5, 



.74 



Varl*i1>ility+ 



TABLE fi 

POWER LAW EXPONENTS FOR VOLUME OF VARIOUS SOLIDS 



Method 4 'Stimulus Range Standard Location of Modulus No* oC 

(Max voi/ttin vol) (Std vol/ Std in Range Subjects 
Min vol) l 



oxy * .02 
Oxy * .02 

Oxy * ,04 
<Jxy • .04 



ratio estimation 9*5 

magnitude estimation 1000 
t 

magnitude estimation 145 

magnitude estimation 1060 

magnitude estimation 70 



N,A, 
78 
11.4 

46 
8 



N»A» 
mid 
mid 

mid 
mid 



M»A* 

i 

10 

to 

10 
10 



10 



10 



10 



10 

to 



Reference 



Ekman* Jung'j 

{ mu 

Tel ght soon Ian 
( 196U ) 



The different measures of variability used by investigators are* tfxy - sample Std dev from regression (log-log plot), 0~ std dev for distribu- 
tion of individual subject exponents, Ox*y - residual variance about regression (log-log plot), R - range of individual subject exponents, ru - 
width of range of individual subject exponents* 



H-A* - parameter to Hot Applicable 



9 



34; 



I 



%1 

TABLE 10- 



POWER LAM EXPONENTS FOR APPARENT SIZE OF PERSPECTIVE DRAWINGS 



Ho* 


Voluim* Exponent 
(perspective 
rtrawintju) 


Variability* 
MctSure 


Method ^Stimulus Range 
(Man Vol/Hin vol) 


Standard 
(Std v 0 l/ 
Win vol) 


Location of 
Std In Range 


Modulus 


H>* of 
Subjects 
i 


Reference 


1* 


cube 






ratio estimation 




j N#A* 


H*A* 




TO 


j 

Ekman, Jungs 

(1961) 
experiment 1 


2* 


cube 






tn*c*,i1-ude estimation 

i 


9500 


m 


mid 


1 

100* 


12 


Ekman, Junyo 

(196t> 
experiment 3 


3. 


sphere 


.74 




magnitude estimation 


9500 


100 


mid 


100 


12 


•* 




various cubes 
and spheres 


.69 


S.£« - *Q5 


magnitude estimation 


9500 


60,600 


lowf mid 


100, 1000 


186 


Ekman f Lineman, 
till liam-Qlsson 
(1961) 
experiments 1 ft 2 




cubes and 
spheres with 
*jrface texture 






magnitude estimation 


3500 


60 


low 


100 


99 


Exman, % iri'V. n, 
William* .son 
( 1961 ) 
experiment 3 



* The different measures of variability used by investigators are* Oxy - sample &td <3tev from regression (log-log plot), o - std dev for distribu- 
tion of individual subject exponent?* ok y - residual variance about regression (log-log plot}* ft - range of individual subject exponents, for - 
width of range of individual subject exponents, 

** N*A* - parameter ie Not Applicable 
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TABLE 

fOWER LAW EXPONENTS FOR PROPDRTIOtf AND KUMEROUSNESS 



Mo* Exponent 



Variability* 
Htrisure 



Method 



Stimulus Ratio Standard 



Location of Modulus Mo* of 
Std in Range Subjects 



Reference 



Pfopor t ion 
1* i97 <j - .38 

Humerousneas 
2* 1*34 



3* 
4* 

5* 

6. 

7* 



1*03 
.72 

.77 
.93 



magnitude estimation 80 elements 
(dots 6 lines) 

1 s/eo - 75/eo 



fractionation 



2-180 dot* 



40/80 



N*A. 



0 - *23 magnitude estimation 9- 82 dots 27 

magnitude estimation 25-200 dots none used 

magnitude estimation 25*200 dots none used 

magnitude estimation 25-400 X's none used 

magnitude production 25-200 X'e none used 



mid 



W.A. 

mid 
W.A. 

W.A* 

N * A. 

W.A. 



10 30 



N*A. 5 



10 
H.A. 

H.a. 

W.A. 

I*. A* 



36 
30 

32 

32 

32 



Rule (1968) 



Stevens* S.S. 
( 1957) based on 
data by Taves 
( 1°41) 

Rua*> ' ' ,56) 

Krueger { 1972 ) 
experiment 1 

Kruoger ( 1972) 
experiment 2 

Krueger (1972) 
experiment 3 

Krueger ( 1972) 
experiment 4 



tt Ratio setting is a modification of fractionation* Ratio estimation is a modification of the method of constant sums. 

* The different measures of variability used by investigators *roj axy - sample $td dev from regression {log-log ploth 0- std dev for distribu- 
tion of individual subject exponent*;* Ox y - residual variance about regression {log-log plit) t n - range of individual subject exponents* *- 
Q of rar.ge of indivitlual subject exponents, 

EI^IC* parameter is Not Applicable \ } ^ *^*f / 
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- tabled 





Exponents For 


Saturation of Sur f dee 


Colors 








Hue 


Wave Length 


- 

Luminance Factor 


Artificial 


Daylight 




tnm) 


{% reflectance) 


Light 












(69.7 db) 












** 


ft lo 
U . / v 


4° 


0 .7 ° 


Bluish purple 


425 


18.6 


1.77 








Purplish blue 


462 


"16. 0 


1.44 








Blue 


473 


19.5 


1.50 


1.97 


1.11 


1.94 


Greenish blue 


481 


17.5 


1.97 








Blue green 




18 • o 


2.00 








Green 




oft ^ 


1.97 








Yellowish green 


556 


27.5 


2.84 


09 


2.46 


2.86 


Greenish yellow 


573 


47.9 


4.06 








Yellow 


577 


53.4 


3.58 


2.85 


4.01 


2.84 


Orance 


588 


17.6 


2.60 


2.96 


2.74 


3.01 


Orange pink 


604 


13.5 


2. 17 








Pink 


o14 


15.1 


2.26 








Pinkish red 


630 


21.5 


2.24 


1.73 


• 1.60 


1.84 


Purplish pinH 


499 


18.3 * 


2.39 








Reddish purple 


562 


19.7 


1.96- 









Prom Guirao S de Mattiello (197^ 



ERLC 



Table^/7^ 

Exponents For Lightness of Surface Colors 



Hue 


Colormetric 
Purity 


Ran<ge 
Luminance 
Factor (%) 


Exponent 
Daylight Artificial 
Light 


Gray 


- 


.4-80 


1.07 


.92 


Blue 


*03 


5-59 


.84 


.67 


(470 run) 












.05 


- 4-40 


.73 


.77 




.07 


e-25 


1.03 


1. IS 




.09 


7-30 


* oo 


• Oil 




.16 


7-19 


.78 


.65 


Green 


♦ 23 


9-63 


.90 


.SO 


(553 run) 












.26 


9*61 


1.04 


1.03 




.34 


16-41 


.92 


.76 




.39 


13-47 


• 25 


• 69 




.43 


16-33 


.73 


.70 


Yellow 


.36 


9-68 


1.99 


.48 


(574 nm) 












.47 


8-38 


1.02 


.72 




.57 


19-70 




* /o 




.69 


20-72 


, 1.90 


.52 




* 

* 76 


iCJ— /D 


.94 


.50 


Red 


.07 


17-43 


1.12 


.92 


(622 nm) 












.11 


6-55 


.88 ' 


,86 




.16 


6-4 1 


.90 


.90 




*23* 


7-28 


1.04 


.96 




.51 


3-17 


.68 


.62 



From de Mattiello Guirao (1974) 



Figure '3 



LUMINANCE LEVELS FOR A NUMBER OF COMMONLY 
EXPERIENCED CONDITIONS 
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Figure 



VISUAL ACUITY AS A FUNCTION OF BACKGROUND LUMINANCE 
(from Moon and Spen.az;, 1944) 
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Figure & 
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Figure ^ 

GRAPH SHOWING THEORETICAL (solid lines) Pm EMPIRICAL 
(dashed line) FO?M OF WEBER* s LAW 
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Figure $b 



EXPENDITURES 
(Greater than one JWD) 
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Figure lib 

TECHNOLOGY MANPOWER FOR DIFFERENT REGIONS OF U.S. 
(Greater than one JHP 




Figure t2^ 
RELATION OF HUE TO WAVELENGTH 
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MEAN JND's AND STANDARD DEVIATIONS FROM 410-63*. 
(from Siegel, 1964) 
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Figure 1. Reductions of two scatterplois used in the three types of experiments. The left p, nel 
is point'Cfoud size 2 and the ripht panel is point-cloud size 4. 
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Table 3.14 



Integral Dimensions 



Object 



Dimensions 



Experimental Task. 



Reference 



Munsell 
Chip 



Brightness 
Saturation 



Free classification 
Speeded classification 



Handles Imai 09^2) 
Hyman^ Well (1 963) 
Garner & Felfoldy (1970) 



Dot 



Horizontal position 
Vertical position 



Speeded classification Garner &" Felfoldy (1970) 



Ellipse 



Eccentricity size 



Absolute "judgement 



Egeth s ^Pachella (1969) 



Rectangle tengtfi 
Width 



Relative coding 
Absolute judgement 



Dykes £ Cooler (1978) 
Felfold^ (1974) 



Obtuse Height 

Triangle Length of right side 



Free classification 



Somers s Pachella (1978) 
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Table 5,1 



Distribution of Questions for Each of the Different Operating Principles 



Syntax 

■ Adequate Dimensional Gestalt Perceptual Processing Processing 
Discrimin- Structure Qrganiza- Distortion Limitations Priorities 
ability tion 

No, of * 

questions 11" 1 9 1 6 11 



Formal and Semantic 

Horizontal Vertical Schema Surface ■ 

Mapping Happing Availability Compatibility 

No. of 

questions .9 17 4 7 > 



t!o. of 
questions 



Pragmatics 



Contextual Invited 
Compatibility Inference 



\ - 



I 



T&ble 5*2 



Distribution of Questions for Each of the Graphic Constituents 

and 'tfieir Combinations 



Cons ti tuents 

Framework Specifier Labels Background 

Mo* of 

questions ,17 19 28 4 ■ 



C0mbinati6ns of Constituents 



^ No* of 
questions 



Frame - Specifiers Frame - Specifiers - Labels 
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A 



Table 5*? 



possible Outcomes of aiy'JVnalysis of a Graph by Two Analysts 



Analyst 1 



Problem 



No Problem 



Analyst 2 
Problem No Problem 



* 4. 



ERIC 



42o 



Table 5*4 



Results Prom Analysis of Ten Graphs by Two Analysts 

Analyst 2 



Analyst 1 





Problem 


No Problem 


Problem 


58 


9 


No Problem 


18 


705 









9 

ERLC 



4 



2* 



* Table 5*5 

Distribution of Graphs as a Function of the Sampling Scheme Categories 



Content Area 

Physical Life Social ^ General 

Hath Science ■ Science Science Business " Interest 

Wo, of 
Graphs 

Analyzed 10 ^ 11 , 15 13 10 



Audience 

Adu It Se conda ry P ri mary Ge ne r al 

tfo, of 
Graphs 

Ana lyz ed 40 18 7 10 



Publication Format 



General 

Journal Textbook Reading Newspaper Magazine 

Wo, of 
Graphs 

Analyzed 15 P - 37 16 3 4 



Visual Format 



Bar Line Pie Other 

Ho* of 
Graphs 

Analyzed ,22 24 10 19 



Table 5.6 



Proportion of Faults for the Different Sampling Scheme Categories 



Content Area 



Hath 



Physical 
Science 



Life 
Science 



Social 
Science 



Business 



General 
Interest 



Faults/ 
Graph 



1.2 

(10)* 



1.9 
01) 



1.2 
(16) 



1.7 

(15) 



2.8 

(13) 



1 .4 
(10) 



A ^ a nce 



Adult 



Secondary 



Primary 



General 



Faults/ 
Graph 



1.9 

(40) 



1.4 
(18) 



1.4 

(7) 



1.5 

(10) 



Publication Format 



Faults/ 
Graph 



General 



2.1 

(16) 



Journal 



1.4 

(15) 



Magazine Newspaper Textbook 



1.25 

(4) 



0.33 
(3) 



1 .81 

(37) 



Visual Format 



Bar 



Line 



Pxe 



Other 



Faults/ 
Graph 



1 .7 

(22) 



1.5 

(24) 



1.2 

(10) 



2.1 

(19) 



dumber of graphs in parentheses 



Table 5*7 f 



Distribution of Faults Per Question Set as a Function of the Different Levels 

of Analysis and Operating Principles 



(a) 

Levels of Analysis 

Syntax Semantics Pragmatics . Formal 

Proportion 1*5 1*2* 1*0 * KS 

of Faults (39)* (11) (9) (26) 



♦Number of questions in a set are shown in the parentheses* 



(b) 

Operating Principles 



Syntax 

Adequate * Gestalt 

Discrimin- Dimensional Grganisa- 
a&ility Structure tion 



Perceptual Processing Processing 
Distortion Limitations Priorities 



Proportion /f*6 
of Faults 11) 



0 

(1) 



U9 
(9) 



2*0 
(1) 



1*3 

(6) 



1*3 
(11) 



Formal and Semantic 



Proportion 
of Faults " 



Internal 
Mappi ng 

1*0 
(9) 



External 
Mapping 

2*2 
(17) 



Schema 
Availability 

1 *2 
(4) 



Surface 
Compa tibili ty 

1 *1 

(7) 



h Pragmatics 

Contextual Invited 

Compa tibi;ity Inference 

Proportion o*9 . 0*6 

of Faults f {4) (5) 



42, 



■ Table 5.8 



Distribution of Fault Proportions as a Function of 
the r* 'ferent Graptic Constituents 



Constituent 



Background Label Frame Specifier Fra-Spec LA-FR-Spec Mult-Fra 



Proportion 
of Faults 



1 .0 

(4)* 



1.7" 
(28) 



1.0 
(17) 



2,3 
(19) 



2,0 
C3) 



1 ,7 
(4) 



0,3 
(10) 



"number of questions in a set is shown in parentheses* 



0 

ERIC 



42o 
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Table 5*9 

Breakdown of Fault Proportion for Specifier and Frame-Specifier Combination 
in Terms of the Different Operating Principles 

Operating Principle 

Adequate Gestalt Processing Processing 

Internal External Surface Discrimin- Grganiz- Perceptual Limita- Priorities 

Mapping Happing Compatibility ability ation Distortion tions 

Proportion 1*00 0 0 0 0 0 0 0 

of Faults - 0*02 0*37 0*09 0*23 0*02 0*05 0*14 0*07 



o 420 
ERIC 



43^ 




- 43i 

48 



/ 



Distribution of Total Sample of Charts and Graphs Visual Format 



field 



Audience 



Format 



Bay 
- 1 



Line Pie Oth<ar 



B** - 

Mathematics 


Adult 


Journal (MAJ) 


1 


4 




2 


Textbook ( 




2 




1 


General Readinq (MAG) 










Secondary 


Textbook ' (MST) 


4 


4 


2, 


2 


General Reading (MSG) 










Pre -Secondary 


Textbook \ * ' (MPT) 








i 


General Readinq (MPG) 










Physical Sciences 
* 


Adult 1 ■ ■ 


Journal * (PAJ) 


2 


4 






Textbook - (PAT) 




14 






General Readinq (PAG) 


2 


1 






seconds ry 


Textbook (PST) 


3 


4 


1 


4 


General Readinq (PSG) 










Pre ** S e co nda r y 


Textbook (PPT) 


1 


1 






General Reading (PPG) 










Life Sciences 


- 

Adult 


Journal (LAJ> 


13 


15 




4 


Textbook (LAT) 


2 


6 




3 


General Readinq (LAG) 


5 


8 


2 


5 


secondary 


Textbook (LST) 


2 


1 




1 


General Readinq (LSG) 










rtc onaa ry 


Textbook (LPT) 




1 




2- 


General Readinq (LPG) 










Social Sciences 


Adult 


Journal" (SAJ) 


5 


7 




2 


Textbook "(SAT) 


2 


3 




2 


General Ruadinq (SAG) 


2 


2 




2 


-- secondary 


Textbook (SST) 


5 


3 


3. 


2 


General Readinq (SSG) 












Textbook (SPT) 


2 




1 




General Readinq (SPG) 




* 






Business 


Adult 


Journal (BAJ) 


HP 


11 


1 


5 


Textbook (BAT) 


[ 


2 






General Readinq (BAG) 


6 


3 


2 


3 


Secondary 


Textbook (BST) 


1 


5 


2 




General Readinq (BSG) 










Pre-Sccondary 


Textbook (BPT) 










General Reading f BPG) 










/(General Interest 


General * a r\ - 

4o£ 


Newnaper (GIN) 


6 


10 




4 


Magazine (GIM) 


13 


21 


2 


8 


Gor.oral Roadina (GIG) 


■ 6 


11 


1 


1 
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Tabled* Species of visual displays* Examples are in pftrencheses , 



TYPE 



. Intri nsic 
ConL Lmrat ion 



Illustration (diagram of rhouibus) 



Problem Solving 



(diagram used to prove 
theorem) 



Problem Defining (Euclidean experi- 
mental sketch) 



( Cloorplun) 



(pulley diagram 
used to anti- 
cipate movements) 

(Einsteinian 
"thought expert 
ment" image) 



Symbol 



(graj>hK) 



(V^nn diagram) 



(sketch of liilbert 
space) 
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